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PREFACE 


Work on this book Avas started when both of us were members of the 
staff of Iowa State College. At that time requests were received rather 
frequently from research workers. Some wanted advice on the conduct 
of a specific experiment: others, who had decided to use one of the more 
complex designs that have been discovered in recent years, asked for a 
plan or layout that could be followed during the experimental opera- 
tions. Although the logical principles governing the subject of experi- 
mentation are admirably expounded in Fisher’s book The Design of 
Ex'periments, these requests indicated a need for a different type of 
book, one which would describe in some detail the most useful of the 
designs that have been developed, with accompanying plans and an 
account of the expeidmental situations for which each design is mOvSt 
suitable. Such a book is directed at the experimenter and is intended 
to serve as a handbook which is consulted when a new experiment is 
under consideration. , 

Mainly on account of the war, slow progress was made. In 1944 
wo completed a mimeographed draft of which several hundred copies 
were distributed. Many helpful suggestions were made by readers. 
Of these, the most frequent was to the effect that we should include more 
material dealing with the statistical analysis of the results. In the 
mimeographed draft, our practice was to give references to worked 
examples of the analysis in cases where they could be found in the 
literature, and to present examples for only those designs whose anabasis 
was not available. To this it was objected that man^^ research workers 
did not have easy access to our references. 

This suggestion raised a difficult issue. To present a self-contained 
a<iCoimt of the anatysis of variance in all its ramificationB would make 
the book, it seemed to us, unwarrantably long and expensive. Conse- 
quently we have continued to assume that the reader has some knowl- 
edge of the principles of analysis of variance and of the computational 
methods involved. Wo have included a brief review of the l^asic theory 
and an extensive set of worked examples of the analysis for both com- 
mon and less common types of design. Although strenuous efforts 
were made to obtain a selection of examples from diverse fields of 
research, a preponderance from biology, and more particularly from 



agrictiiture, wus dictated by our own experience in those areas. On 
several occasions we rejected an example w^hich would have made an 
attractive addition to the scope because we did not feel sufficiently 
familiar with the conditions of experimentation to give a realistic account 
of the problems encountered. 

Since courses of lectures on the design of experiments are being intro- 
duced in many Golleges and universities, some teachers may be inter- 
ested in the potentialities of the book as a textbook. Several comments 
are prompted by our own experience in this connection. First, it will 
often he ne-cessary for the teacher to provide a more systematic develop- 
ment of the analysis of variance than is given here. Second, interest in 
such a course is greatly enhanced by examples from an environment 
with which the listeners are familiar, and especially by examples of 
experiments that have been conducted by some of the listeners. Thus 
the teacher is well advised to build the course around his own examples, 
using those in this book mainly as supplementary material for the 
students. Finally, a selective use of the book is in order, because it 
contains much more material than can be covered in a typical one- 
quarter course, and because some difficult topics have been dealt with 
the early parts of the book. 

We wish to express our gratitude to the many staff members at Iowa 
State College and North Carolina State College who helped us by putting 
experimental data at our disposal, by providing painstaking descrip- 
tions of their experimental techniques, or by lending their assistanc*e 
in tlie preparation of the manuscript and plans. For similar kindnesses 
in connection with certain of the examples we ai^e inde^bted to 
G. F. Potter, D. Y. Solandt, D. B. DeLiiry and J. Hunter, F. M. Wad- 
ley and C. F. Rainwater, and the Wailuku Sugar Company, Honolulu, 
home theoretical results used in Chapter 14 were developed from research 
conducted at Raleigh under a contract with the Office of Naval Research. 
Finally, our thaxiks go to George W. Snedecor, who participated in the 
original plans for this book and made a careful reading of the first draft. 


W, G. Cochran 
G, M. Cox 
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CHAPTER 1 


INTRODUCTION 

1.1 The Contribution of Statistics to Experimentation 

1.11 The Problem of Interpretation. Since statisticians do not usually 
perform experiments, their claim to attention when they write on this 
subject requires some explanation. It is true that on many important 
aspects of experimentation the statistician has no expert knowledge. 
Nevertheless, in recent years, research workers have turned increasingly 
to statisticians for help both in planning their experiments and in draiv- 
ing conclusions from the results. That this has happened is convincing 
evidence that statistics has something to contribute. 

At first, requests for assistance were nearly always concerned with 
S the int erpretation of the results . It is a common characteristic of ex- 
periments in vddely diverse fields of research that, w^en they are re- 
peated, the effects of the experimental treatments vary from trial to 
trial. This variation introduces a degree of uncertainty into any con- 
clusions that are dravm from the results. Even after a number of repeti- 
tions, or replications as they are called, the investigator still does not 
know by how much his results would be changed if the experiment were 
repeated further under the same conditions. Successive trials may be so 
discrepant in their results that it is doubtful which of two treatments 
would turn out better in the long run. 

As an illustration of this variation, data are given in table 1.1 from a 
simple experiment. The data are the times (minus 2 minutes) required 
to perform a routine statistical calculation, that of finding the sum of 
squares of 27 observations. Each sum of squai^es was computed sep- 
firately by the same person on both of two standard computing machines. 
In all, 10 different sums of squares -were worked, making 10 trials or rep- 
lications of the experiment. It will be noted that the differences in 
speed range from 17 seconds in favor of machine jB to 2 seconds in favor 
of machine A. Some experimenters may comment that the results of 
this experiment are remarkably well behaved, and exhibit nothing like 
the variation with which they have to contend. The results will, how- 
ever, serve our purpose. 
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TABLE 1.1 Time in seconds (minus 2 minutes) eequibbd fob computing 

SUMS OP SQUARES 

Machine 


Replication 

r " 

A 

B 

Dmerenoe 
(A - B) 

1 

30 

14 

16 

i2 

21 

21 

0 

a 

22 

5' 

17 

4 

22 

13 

9 

■ 5 

18 

13 

5 

6 

29 

17 

12 

7 

16 

7 

9 

8 

12 

14 

--2 

9 

23 

8 

15 

10 

23 

24 


Means 

21.6 

13.6 

8.0 


The object of the experiment is, of coui'se, to compare the speeds of 
the two machines for this calculation. More specifically, two objects 
might be stated. The first is to answer the question: is there any dif- 
ference in speed? or, to put it another way, to test the hypothesis thut 
there is no difference in speed. The second object, which is related to the 
first, is to estimate the size of the difference in speed. Almost all ex- 
periments are carried out for one or both of these purposes — the testing 
of hypotheses and the estimation of differences in the effects of different 
treatments. 

As regards the test of the hypothesis that there is no difference in 
speed, we might report, as relevant evidence, that B proved faster 7 
times out of 10, A twice, while once there was a tie. For the problem of 
eKstimation we might report that the average difference in speed in the 
experiment was 8 seconds in favor of B. These purely descriptive state- 
ments do not carry us very far. Their weakness is that they supply no 
information about the reliability of the figures presented. For example, 
have we any confidence that, if the experiment were continued for an- 
other set of 10 trials, the advantage at the end would still be close to 8 
seconds in favor of B? 

Because of the deficiencies in the descriptive approach, experimenters 
have adopted a different point of view in the summarization of their re- 
sults. They tend to reason as follows. Suppose that it were feasible to 
continue the experiment indefinitely under the same conditions. The 
average difference in speed between the two machines would presumably 
settle down to some fixed value. This value, which will be independent 
of the size of the experiment that was actually carried out, may reason- 
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ably be called the difference between and B. From this point of 
view, the problem of summarizing the results may be restated in the 
question: what can we say about the true difference between A and 5? 
This is a problem in induction from the part to the whole, or in sta- 
tistical language, from the sample to the population. A solution to this 
problem has been developed by means of the theory of statistics. It is 
this solution that constitutes the principal contribution of statistics to 
the interpretation of the results. 

1.12 Statistical Inference. Obviously, it cannot be expected that the 
solution will provide the exact value of the unknovii true difference. As 
a less ambitious goal we might hope to be able to find 2 limits within 
which the exact value is certain to lie, but even this cannot quite be at- 
tained. What can be done is that for any chosen probability, say .95, 
two limits are found such that the probability that they enclose the true 
difference is .95. In other words, limits can be found that are almost 
certain to enclose the true difference, where the degree of certainty, as 
measured by the probability, can be chosen by the experimenter. Since 
we wish to focus attention on the type of inferential statement that can 
be made rather than on the method of calculating the limits, the com- 
putations will not be discussed at present. For the example in table 1,1 
they will be found in section 4.42. ‘\^Tien the probability is .95, the limits 
for the true difference in speed between the 2 machines turn out to be 
3.3 and 12.7 seconds in favor of machine B. A statement that B is 
faster by an amount that lies between 3.3 and 12.7 seconds has a 1 in 
20 chance of being wi'ong. If the degree of certainty is decreased bj^' 
lowering the probability to .8, the limits are narrowed to 5.1 and 10,9 
seconds. If the probability is raised to .99, the limits become 1.1 and 
14.9 seconds, and as the probability is brought closer to certainty, the 
limits steadily become farther apart. The limits are called confidence 
limits, and the probabilities are called confidence probabilities.* 

These probabilities are not merely academic abstractions : they can 
be subjected to at least a rough experimental verification. For verifi- 
cation, we need a situation where the true difference between the effects 
of two treatments is known. In toxicology, for instance, this situation 
can sometimes be obtained by diluting a standard poison to a known 
extent. The dilution is sent to the laboratory, labelled as an ^hinknown” 
poison. By experiments on animals, confidence limits are found for the 

* Fisher (1.2) has developed statistical inference in terms of fiducial limits. The 
two concepts, fiducial and conhdeTice limits, have different logical backgrounds, al- 
though in all simple applications to controlled experiments the actual values of the 
fiducial and confidence limits are identical. For a discussion, see Kendall (LI). 
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amount of the unknown that has the same toxicity as a given amount 
of the standard. Since the persons originating the experiment know the 
true value for this amount, they can verify whether the statement that 
the true amount lies within the limits is correct. Ihe practical difficulty 
with this tj^pe of check is the labor required. A large numbei of cxpeii”- 
ments would be needed to verify whether about 95% of the statements 
made with confidence probability .95 were in fact correct, and about 
5% were wrong. 

As we have seen, the statistical solution to the problem of estima- 
tion consists of a statement that the true difference lies betw^een certain 
limits, plus a probability that the statement is correct. It is of interest 
to consider whether this type of information is sufficiently precise to 
permit decisions of practical importance to be made. Although a 
thorough discussion of this question would be rather lengthy, inferences 
of this type often permit definite action to be taken with confidence 
that the action will be fruitful When they fail to decide the point at 
issue, the reason is nearly always that the data obtained are insufficient. 
For illustration, suppose that it is desired to discover whether the ap- 
plication of a dressing of some fertilizer to a crop will be profitable. The 
cost of the fertilizer is such that its application wfill be profitable only if 
it increases yields on the average by 2 or more bushels. A series of ex- 
periments is carried out in order to estimate the true average response 
to the fertilizer. If the 95% confidence limits for the increase due to the 
fertilizer are 4 and 11 bushels, its use can be adopted with a good dc^al 
of assurance that it will be profitable. Similarly, if the 95% confidenco 
limits are —5 and 1 bushels, a decision not to use the fertilizer follows. 
A case where there is uncertainty occurs when the limits are 0 and 5 
bushels. Here it is likely that there will be either a small gain or a small 
loss, but no recommendation can be made without considerable risk of 
its being wrong. If it is important to make the correct decision, further 
experiments must be conducted in order to narrow the distance between 
the confidence limits. 

Thus far we have considered the problem of estimating the true dif- 
ference between the effects of two treatments. In testing hypotheses, 
we are interested in the supposition that the true difference lias some 
specified value, most commonly zero. As in the case of estimation, diffi- 
culty arises because of the variability that is typical of experimental 
data. As a result of this variability, the data are never exactly in agree- 
ment with the hypothesis, and the problem is to decide whether the 
discropany between the data and the hypothesis is to be ascribed to 
these variations or to the fact that the hypothesis is not true. The con- 
tribution of statistics is the operation known as a test of significance. 


1.12 STATISTICAL INFERENCE 5 

Essentially, this is a rule for deciding, from examination of the data, 
whether or not to reject the h^j'pothesis. Such rules are made to satisfy 
two conditions that are obviously desirable: (i) hypotheses that are true 
shall be rejected only very occasionally, and the probability of rejection 
may be chosen by the experimenter; (ii) hypotheses that are false shall 
be rejected as often as possible. 

This technique enables the experimenter to test his hypotheses about 
the action of the treatments, mth the assurance that there is little risk 
of erroneously rejecting a hypothesis that happens to be correct. Prob- 
abilities of .05 and .01 are most commonly used for this risk, and in 
these cases the tests are said to be made at the 5 and 1% significance 
levels respectively. These levels are just useful conventions, and a lower 
probability may be used if the consequences of an erroneous rejection 
of the hypothesis are veiy serious. It should be remembered, however, 
that in lowering this probability value we automatically diminish the 
chance of rejecting a hypothesis that is false. 

A useful property of a test of significance is that it exerts a sobering 
influence on the type of experimenter who jumps to conclusions on 
scanty data, and who might otherwise try to make everyone excited 
about some sensational treatment effect that can well be ascribed to the 
ordinary variation in his experiment. On the whole, however, tests of 
significance are less frequently useful in experimental work than con- 
fidence limits. In many experiments it seems obvious that the dif- 
ferent treatments must have produced some difference, however small, 
in effect. Thus the hypothesis that there is no difference is unrealistic: 
the real problem is to obtain estimates of the sizes of the differences. 

The construction of confidence limits may add something to the in- 
terpretation of a test of significance. For instance, suppose that the 
speeds of the 2 machines in. the example had not been significantly dif- 
ferent. It is a commonplace that this result would not prove that the 
2 machines were identical in speed. However, if the 95% confidence 
limits for the difference in speed were —2 and +4 seconds, we might 
argue that a true difference of 4 seconds, even if it existed, would be of 
no practical significance. Consequently, it could be said that for all 
practical purposes the 2 machines are identical in speed. This is much 
more positive and useful than the mere statement that the difference in 
speeds was not statistically significant. If, on the other hand, the con- 
fidence limits were —30 and +32 seconds, there Is no justification for the 
conclusion that the machines can be regarded as equivalent. All that 
we have learned is that the data are not sufficiently accurate to show 
whethel* there is a difference in speed that is of practical importance. 

To summarize: variability in results is typical in many branches of 
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experimentation. Because of this, the problem of dra'wing conclusions 
from the results is a problem in induction from the sample to the popula- 
tion. The statistical theories of estimation and of testing hypotheses 
provide solutions to this problem in the form of definite statements that 
have a known and controllable probability of being collect. Ihese 
statements are specific enough to be useful in deciding whether action 
can bo taken on the basis of the results. 

1.13 The Function of Randomization. As would be expected, the type 
of statistical inference that can be made from a body of data depends on 
the nature of the data. It is easy to conduct an experiment in such a 
way that no useful inferences can be made, and many of the experiments 
brought to the statistician, particularly in earlier years, have been of this 
type. To take a simple example, suppose that in the comparison of the 
calculating machines each sum of squares had been computed first on 
machine A and then on machine B. Now it is quite possible that in- 
creased familiarity with the data will enable the second computation to 
be done faster than the first. The advantage is unlikely to be great in 
such an easy calculation; it could be so if the computation were more 
difficult. (In the experiment this advantage was estimated at about 4 
seconds, though the confidence limits for the true advantage are rather 
far apart.) 

If the e.xperiment is conducted in this way, the observed difference in 
speed (B — A) is an estimate of the true difference, plus the unknown 
diflerenee in speed between a second calculation and a first. Confidence 
limits set up by statistical techniques apply not to the true difference 
between B and A but to the true difference plus this unknomi advantage. 
Consequently, the limits tell us nothing definite about the tine difference 
between B and A. The interpretation of tests of significance also be- 
comes confused. If A is found significantly faster than B by ordinary 
statistical tests, we can be confident that there is a real difference in 
favor of A, since A was handicapped in the course of the e.xperiment. 
But if B is found significantly faster than A, we do not know what to 
conclude. In this case we are dealing with a bias whose nature can be 
anticipated before the experiment has started. In other e.xperiments 
where less is known about the type of variability that is present, similar 
biases that are quite uneiqiected can occur from some apparently in- 
nocuous rule about the way in which different treatments are handled. 

. In order to avoid these biases we need some means of insuring that a 
will not be continually favored or handicapped in successive 
iS by some extraneous source of variation, known or unknown. / 
le by the device Imown as randomization, due to Fisher. In- 
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stead of performing every calculation first on machine A, we apply the 
principle of randomization tossing a coin to determine whether or 
B shall be used first in any trial. The decision is made independently 
in each trial. The effect is that in any t rial each machi ne has 
chance of bein g test^ und er the more fav o rab le conditions. Of course, 
the result of any specific randomization may favor one or the other 
treatment. But this happens only to an extent that is allowed for in the 
calculations that ai'e used for tests of significance and confidence limits. 
This important result has been illustrated in detail by Fisher (1.2), who 
has sho™ how tests of significance and confidence limits can be con- 
structed, using only the fact that randomization has been properly ap- 
plied in the experiment. Randomization is one of the few character- 
istics of modern experimental design that appears to be really modern. 
One can find experiments made 100 or 150 years ago that embody the 
principles that are now regarded as sound, with the conspicuous ex- 
ception of randomization. 

The occasions on which randomization is required vary mth the type 
of experiment and must be left to the judgment of the experimenter. 
One occasion arises when the treatments are allotted to the experimental 
material. Suppose that the effects of different diets on the heights and 
weights of children are to be ascertained. Since different children grow 
at different rates, a treatment that happens to be assigned to a group of 
fast-growing youngsters is favored. Consequently, we allot the diets in 
any replication at random to the children who are to receive them, with 
a new random allotment in each replication. Similarly, if 4 different 
oven temperatures for the cooking of roasts are under comparison, the 
4 temperatures are assigned at random to the 4 roasts "which form the 
material for any replication. Sometimes this is the only randomization 
required, but frequently other operations that are carried out in the 
course of the experiment are also potential sources of bias. With a 
repetitious operation the order of events may be important, either be- 
cause a learning process is involved which tends to make later operations 
better than the earlier ones, or because fatigue tends in the opposite 
direction. S^^stematic biases may be guarded against by randomizing 
the order in which the operation is performed on the different treatments 
in a replication. In other cases the equipment that is used introduces 
variation. For example, if 4 ovens are available to cook the 4 roasts, 
we would not always use the same oven for the same temperature, in 
case biases should be introduced because of systematic differences among 
the ovens. Instead, the temperatures could be assigned at random to 
, the ovens in each replication. Thus we have tm randomizations, one 
to assign the temperatures to the roasts and one to assign them to the 
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ovens. If, however, we decide, before randomizing, which roast is to go 
in which oven, the two randomizations can be reduced to one. liiib 
method can ahvays be used, if convenient, to cut down the niimbei of 
randomizations that must be made. 

Randomization is somewhat analogous to insurance, in that it is a 
precaution against disturbances that may or may not occur and that 
may or may not be serious if they do occur. It is generally advisable to 
take the trouble to randomize even when it is not expected that there 
will he any serious bias from failure to randomize. The experimenter is 
thus protected against unusual events that upset his expectations. Of 
course in experiments where a great number of physical operations are 
involved, the application of randomization to every operation becomes 
time consuming, and the experimenter may use his judgment in omitting 
randomization where there is real knowledge that the results will not be 
vitiated, It should be realized, however, that failure to randomize at 
any stage may introduce bias unless either the variation introduced in 
that stage is negligible, or the experiment effectively randomizes itseli. 

1.14 Restricted Randomization. We now reconsider the randomiza- 
tion proposed for tlnv experiment on computing machines, in order to 
discus>s a criticism that presumably has occurred to the reader. When 
this experiment is being planned, ive possess the knowledge tlmt there is 
some advantage to the machine on which the second calciilaiion of any 
sum of squares is made. In the light of this knowledge, it seems wise to 
make sure that each machine receives the advantage in five of the 10 
replications, rather than to leave this to be decided by the tossing of 
coins. In fact, if the decision is made by tossing coins, the chances are 
only a})out 1 in 4 that each machine will be used first in five replications; 
they are also about 1 in 4 that one machine will be used first in 7 replica- 
tions. The criticism is that the randomization is unlikely to give as ac- 
curate a comparison as the proposed alternative. The criticism is sound, 
and the experiment was actually planned so that each machine would 
be used first in half the replications. 

Randomization was also applied in the design actually used, though it 
is a different type from that originally proposed, being subject to' the 
restriction that each machine must appear first exactly five times. Iluj 
process of randomizing therefore consisted in choosing five numbers at 
random from the numbers 1 to 10; these turned out to be 1, 3, 6, 8, and 0. 
Accordingly, machine A was used first in replications 1, 3, 6, 8, and 9. 
This is one of the cases where it is difficult to see why failure to random- 
ize could have led to any serious danger of bias. It probably would have 
ton quite satisfactory to use machine A first in the first 6 replicates, and 
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machine S first in the remainder. Nevertheless, it happen that 
the advantage in the second calculation of a sum of squares would 
diminish in later replications; and, to save the trouble of trying to guess 
whether biases of this type are likely, the randomization was used as a 
precaution. 

The example represents a fairly common situation, in which knowdedge 
of the variation that will affect the results is suiSciently detailed so that 
the simplest type of randomization is objectionable on the grounds of 
accuracy. Most of the designs in this book were constructed for this 
situation. By suitable restrictions on the randomization, these designs 
enable the experimenter to utilize any knowledge that wdll increase the 
precision of the experiment. In addition, each design allows sufllcient 
randomization to prevent biases from soiu’ces of variation about which 
knowledge is less certain. For each of the principal designs described, 
the appropriate method of randomization is also presented. 

One further point should be mentioned. We have seen that the ex- 
periment on computing speeds could have been carried out either by 
means of the simple randomization originally proposed or by means of 
the restricted randomization which ensures that each machine is used 
first five times. Both methods provide a valid test of significance and 
valid confidence limits. However, the calculations^ ^m for the test of 
significance and for the eonfidence limits are different in the 2 cases. 
The first experiment is of the type known as randomized blocks, and 
calculations are made as described in section 4.23; the second experi- 
ment is of the ^Tross-over^^ type, for which the calculations are given in 
section 4.42. This is an instance of the general rule that the ^vay in 
which the experiment is conducted determines not only whether in- 
ferences can be made, but also the calculations required to make them. 
The experimenter must make sure that the calculations which he uses 
are appropriate for the experiment. 

1.2 Initial Steps in the Planning of Experiments 

1.21 Importance of the Initial Steps, It has been mentioned that 
statisticians are often asked for advice in making inferences from the 
results of experiments. Since the inferences that can be made depend on 
the Avay in which the experiment was carried out, the statistician should 
recpiest a detailed description of the experiment and its objectives. It may 
then become evident that no inferences can be made or that those which 
can be made do not answer the questions to which the experimenter had 
hoped to find answers. In these unhappy circumstances, about all that 
can be done is to indicate, if possible, how to avoid this outcome in future 
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fexperimcnts. Consequently, it has come to be realized that the tim e to 

t])in k, about statistical inference, or to seek adv ice, is when the expen- 

t ent is beinp: planned . 

Participation in the initial stages of experiments in difierent areas of 
research leads to a strong conviction that too little time and effort is put 
into the ].)lanmng of experiments. The statistician who expects that his 
contribution to the planning will involve some technical matter in sta- 
tistical theory finds repeatedly that he makes a nauch more valuable 
contribution sirn])ly by getting the investigator to explain clearly why 
hi) is doing the experiment, to justify the experimental treatments whose 
effects he pro|)oses to compare, and to defend his claim that the com- 
pleted experiment will enable its objectives to be realized. For this 
reason the remainder of this chapter is devoted to some elementary 
comments on the subject of planning. These comments are oflered with 
diffidence, because they concern questions on which the statistician has, 
or should have, no special authority, and because some of the advice is 
so trite that it would be unnecessary if it were not so often overlooked. 

It is a good practice to make a written draft of the proposals for any 
experiment. This draft will in general have three parts: (i) a statement 
of the objectives; (ii) a description of the experiment, covering such 
matters as the experimental treatments, the size of the experiment, and 
the experimental material; and (iii) an outline of the method of analysis 
of the results. 

1.22 The Statement of Objectives. This statement may be in the 
form of the questions to be answered, the hypotheses to be tested, or the 
effects to he estimated. The aim should be to make the statement lucid 
and specific. The most common faults are vagueness and excessive 
ambition, in the sense that a twenty-year research program ivould be 
required in order to realize the stated objectives. Often it is advisable 
to classify the objectives as major and minor, because some types of 
experiment give high precision for certain treatment comparisons Init 
low precision for others. When the experiment represents a eoopera-tion 
among people of different interests, this classification is particularly 
helpful in that it makes clear which objectives take priority and helps 
to avoid an unhappy compromise that is adopted in the hope of pleasing 
everybody. 

The statement should include an account of the area over which gen- 
eralizations are to be made, or in other words, of the population about 
which it is hoped to make inferences. If an experiment is to be con- 
ducted on persons suffering from some disease, are the results presumed 
to apply to the patients in a specific hospital, to patients in all hospitals, 
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or to allmfferers whether in hospital or not? This sort of question is 
crucial in applied research, where often the experimenter has some 
definite population in mind to which he would like to apply the results. 
It is obvious that worth-while inferences about an extensive population 
cannot be made from a single experiment. For instance, inferences made 
from the experiment on the two computing machines are restricted to 
the person who made the calculations and to the type of calculation 
made. There is no guarantee that the results would be the same for a 
different type of calculation or vdth different operators. Consequently, 
if the object were to find out which machine is the speedier in general 
use in a computing laboratory, the experiment that was made has only 
scratched the surface of the problem. 

1.23 The Experimental Treatments. We have used the term treat-- 
mentB to denote the different procedures whos e effec t s are to be measured 
and compared^ In the selection of treatments it is important to define 
cleaflyneach treatment and to understand the role that each treatment 
will play in reaching the objectives of the experiment. Some issues that 
arise in particular cases are as follows. 

1. Confusion is sometimes caused by failure to distinguish whether 
the cfiqeet is merely to ^^spot the winner” among the different treatments, 
or whether in addition it is desired to find some clues as to why the 
treatments behave as they do. A good example is the experiment which 
demonstrates, or so we are informed, that each of the three treatmentvS, 
whiskey and water, gin and water, and rum and water, taken orally in 
sufficient quantities, produces some degree of intoxication. By itself the 
experiment provides no information as to whether the intoxication is due 
to the water, to the other ingredient, or to the fact that the two are 
mixed. A more extensive experiment with additional treatments would 
be necessary to throw some light on this question. Although there are 
occasions when it is sufficient simply to discover which is the best of the 
treatments, experience suggests that even in strictly applied research 
progress is faster if experiments supply fundamental knowledge. Sim- 
ilarly, the criticism that a certain treatment should be omitted because 
it Avould not be used in practice is valid if the aim is to find the best of 
the ‘‘practical” treatments, but it is not valid if the “impractical” 
treatment will supply some needed information about the behavior of 
other treatments in the experiment. 

2. The specification of the treatments may raise difficult questions 
about the conditions under which the treatments are to be compared. 
Suppose that the object is to find out the effect of an application of 
sulphate of ammonia on the yield of some crop. It is well loiowm that 
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this offect is likely to depend on the amounts of other important plant 
nutrients that are available to the crop. Consequently, it must be 
decided whether to add these nutrients so that they will be m abundant 
supply on all plots, or to test the nitrogen on the soil as it happens to be. 
The decision must be guided bj’' the objectives of the expermient, and 
more particularly by the type of population to which inferences are to 
jipply. Sometimes it is advisable to test the nitrogen in both the pies- 
enco and the absence of the other nutrients. One result of this dedsion 
will be that the experiment becomes also a test of the other nutrients, 
even though this may not have been part of the original plan. E.\-peri- 
monts (»f this type, where one factor is tested over different levels of 
another factor, are known a& factorial experiments and have come to play 
a prominent part in experimentation. They are discussed more fully in 
chapters. 

3. In some cases it becomes apparent that the treatments that can 
be tested in practice are not those that we would like to test. The follow- 
ing example is typical of many fields where human relations are involved. 
Suppose that it is desired to compare two methods ri and B of teaching a 
foreign language, and that we have defined each method dearly and 
have decided how to measure the success achieved by oadi method. 
However, some of the teachers who are to participate in the test already 
use a method similar to method A and have strong beliefs that method 
B is inferior, whereas others already follow method B and hax'e no i!S(> 
for method ri, If the teachers are divided at random into the groups 
that are to teach the two methods, we may expect that some teachers 
will be assigned to teach a method in which they have no faith, and we 
may deckle that this is not the kind of comparison that we had in mind. 
On the other hand, if we forsake randomization and allow teachers to 
use the method that they like, differences attributed to the methods may 
in fact be due to differences in the ability of the two groups of teachers. 
Given enough resources, the situation can be more thoroughly explored 
by having each method taught by those who like it, those who do not, 
and those who are neutral, so that there are six treatments instead of 
two. In practice, the resources may not permit this elaboration, and 
the issues that must be faced are how best to use the resources available, 
and^ whether the experiment, as it can be done, is worth doing. Here 
again, the deciding factor is usually the use to which the results are to be 

4. Discussion may arise as to the need for a “control,” this tern being 
used rather loosely for a treatment in which we are not particularly in- 
-terested, but which may be needed in order to reveal, by comparison 
whether other treatments are effective. Suppose that we wish to com- 
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pare the effectiveness of three treatments that are qualitatively similar, 
lor example, three nitrogenous fertilkers all of which supply the same 
ajncmnt of nitrogen. The control would be a “lio-nitrogen” treatment. 
Three cases may be distinguished, (i) The effectiveness of this type of 
treatment has been consistently demonstrated previous^, and it re- 
mains to discover which of the three qualities is best. There is no need 
for a control, (ii) The tyx)e of treatment is in general effective, but oc- 
casionally the conditions of the test are such that it is not. For ex- 
ample, nitrogenous fertiliziers may fail to produce responses in fields 
where the soil fertility is very high. In this case it will be well to add a 
control, which serves primarity to describe the conditions under which 
the fertilizers ^vere tested, (iii) It may not be known whether the type 
of treatment is effective. The control should of course be included, and 
there may be a case for giving it more replication than the other treat- 
ments receive. An experiment of this type is analyzed in chapter 3, 
where the treatments were 4 types of soil fumigant, and the control (no 
fumigation) was replicated 16 times as against 4 for the other treatments. 
The effect is to obtain a more accurate estimate of ihe mmmge response 
to fumigants, at the expense of some loss in accuracy in the comparisons 
among the fimiigants. 

An interesting example of the illumination that is sometimes produced 
by tJie inclusion of a control has been reported by Jellinek (1,3). A 
headacdiiv remedy contained 3 ingredients, a, h, and c. In order to test 
wlietiier ingredients 5 and c were necessary, the complete mixture (abc) 
was compared with (oc) and (ah). There were 199 subjects, each of 
whom was treated with each dimg for a two-week period, the appropriate 
drug being given whenever the subject complained of a headache. Suc- 
cess was measured by the ratio of the number of headaches relieved to 
the number tested in the two-week period. The mean success rates 'were 
0,84 for (ahe), 0.80 for (ac), and 0.80 for (ah); these figures show very 
little overall difference in effectiveness. 

The experiment also contained a ^^controF —an inert mixture which 
looked the same as the others but had no active ingredients. This was 
tested under tlie same conditions as the 3 drugs. No less than 120 sub- 
jects, or about 60%, jreported at least some of their headaches relieved 
by the control, a result that is itself of some interest. Further, the 
control enables us to isolate the group of 79 subjects whose headaches 
were not relieved by an inert, mixture. For this group, the mean success 
rates were 0.88 for (ahe), 0.67 for (ac), and 0.77 for (ah), and could be 
shown to be significantly different. As the author comments: ^^Banal as 
it may sound, discrimination among remedies for pain can be made only 
by subjects who have a pain on which the analgesic action can be tested.^^ 
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If a control is required, it must be an integral part of the expeiiiiierit 
so that results for the control are directly comparable with those for the 
other treatments. This point tends to be overlooked in expeiimemts 
with human subjects when it is difficult or troublesome to assemble the 
desired number of subjects. For example, if a new drug is to be tested 
in some ward of a hospital, the recovery rate in the ward before the drug 
was introduced is not a satisfactory control, nor is the recovery rate in a 
different ward where patients happen to be receiving the standard drug. 
An observed difference between the effects of the new and the standard 
drug might be due to differences in the severity of the disease or in the 
type of patient or in other aspects of the medical care in the two time- 
periods or the two wards. It is necessary to regard the new and the 
standard drug as two experimental treatments on an equal footing, and 
to use randomization, or one of the methods of restricted randomization, 
in assigning the two drugs to patients. 

1.24 Further Details. Other features of the experiment that should 
be included in the draft of the proposals are the niimber of re|.)lications, 
the types of experimental material to be used, and the rtieasiirernents 
that are to be made. Since these primarily affect the accuracy of the 
experiment, discussion of them is deferred to chapter 2, which is devoted 
to the general question of accuracy. 

Finally, the draft should describe in some detail tlie proposed niethod 
for drawing conclusions from the results, This is the section that is 
most frequently omitted, though it is a valuable one. It inay iriclude a 
sketch of the analysis of variance, an indication of the tal>u!ar form in 
which results will be sho^vn, and some account of the tests of sigriificance 
to be made and the treatment differences that are to be estimated. In 
this process we verify which treatment eomparisons are relevant to 
each of the stated objectives of the experiment. Attention is drawn to 
deficiencies in the set of treatments and to treatments that supply no 
information essential to the purpose of the experiment. 
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CHAPTER 2 


METHODS FOR INCREASING THE ACCURACY OF 
EXPERIMENTS 


^ 2.1 Introductioii 

The result s of experiments are affected n ot only by the action of the 
but a lso by extraneous variations which tend to masV the 
effects of the treatments. The term ex venmental error s is often applied 
these variations, where the word errors is not synonymous with 
V']%^'^‘bnistakes,” but includes all types of extraneous variation. Two main 
sources of experimental errors may be distinguished. The first is in- 
herent variability in the experimental material to which the treatments 
are applied. We shall use the term eicpermentol imfi to denote the group 
of material to which a treatment is applied in a single trial of the ex- 
periment. Theunit may be a plot ofland, a patient in a hospital, or a 
lump of dough, or it may be a group of pigs hi a pen, or a batch of seed. 
It is characteristic of such units that they produce different results even 

when subjected to the same treatment: these differences, whether large 

or small, contribute to the experimental errors. The second source of 
variability is lack of uniformity in the physical conduct of the experi- 
ment, or in other wmrds, failure to standardize the experimental tech- 
nique. 

Neither the presence of experimental errors nor their causes need 
concern the investigator, provided that his results are sufficiently ac- 
curate to permit definite conclusions to be reached. In many fields of 
research, however, with the time and labor that can be given to.an exr 
periment, the results are so greatly influenced by experimental errors 
that only treatment differences that are large can be detected, and even 
these may be subject to considerable uncertainty. As a consequence, 
the investigation of methods for increasing the accuracy of experiments 
has played a prominent part in experimental research in recent years. 
j The methods may be classified broadly into three types. The first is 
I to increase the size of the experiment, either through the provision of 
I more replicates or by the inclusion of additional treatments. The second 

I is to refine the e.xperimental technique. Thirdly, we may handle the 
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experimental material so that tlie effects of friability are reduced. 
This may be done b^^ careful selection of the material, b^r taking ad- 
ditional measurements that provide information about the material, or 
finally by skillful grouping of the experimental units in such a way that 
the units to which one treatment is applied are closel}^ comparable with 
those to which another treatment is applied. These various methods are 
discussed in more detail in succeeding sections. 

At this point it is advisable to discuss briefly the concepts accuracy and 
precision. Although these words are regarded as roughly synonymous 
in most dictionaries, they have sometimes been assigned technical mean- 
ings, particularly in the natural sciences and engineering. The difference 
between the concepts may be illustrated by an experiment in which 
children are weighed on a machine that ha>s a bias of 1 lb. That is, if the 
true w^eight of a child is 46 lb,, repeated weighings of the child give read- 
ings which vary around 47 lb. The accw'aey of a measurement of the 
weight of the child signifies the closeness with which the measurement 
approaches the true weight, 46 lb. The terra precismi^ on tlie other 
hand, is concerned merely with the repeatability of the measurements. 
Thus the precision of a raeasurounent denotes the closeness with which 
the measurement approaches the aA^^erage, 47 lb., of a long stnies of 
measurements made under similar conditions. It is clear that, if the 
bias is large, a measurement be of high precision but of low act- 
curacy.*^ 

Most of the devices mentioned in this chapter, including re]>li(*u.tion, 
additional measurements, and skillful grouping of tlie expcrinKUital mtt- 
terial, operate on the precision of the experiment. In m far as tlie 
method of measurement remains biased, these procedures do not affecd 
this bias. A refinement in technique may reduce it. Oilier things being 
equal, an increase in precision is accompanied by an increase m ac- 
curacy, though if a large bias is present a substantial increase in pre- 
cision may result in a trivial increase in accuracy. 

For simplicity in presentation, Are shall sometimes speak as if no bias 
is present, that is, as if precision and aceura(\y are identi(*al. Thus the 
phrase “true difference between two treatments’^ will mean the true 
difference as recorded by the measuring dovk‘6 that was actually useiL 
This will be identical with the natiirai conctqit of a *ff rue diffVieiua*’’ 
only if the measurements are not biased in favor of one of I he treatments. 

* For a discussion of the difHcudt jiroblcin of measuring precision .and a(*curacy, 
see Shewhart (2.8). 
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2»2 Number of Replications 

Whatever the source of the experimental errors, replication of the 
experiment steadily decreases the error associated with the difference 
between the average lesults for two treatments, piwided that pre- 
cautions (such as randomization) have been taken to ensure that one 
treatment is no more likely to be favored in any replicate than another, 
so that the errors affecting any treatment tend to cancel out as the 
number of replications is increased. The rate at which the experim.eiital 
error is reduced is predictable from statistical theory. The basic qu an- 
tity used to measure experimenta l eiTor^isJlie_grro?yyam^^ 
permmtal umt, which is defined as the exp ected value of the squ are 
of the eiTor that affects the observation for a single experimental im it. 
The square root of this quantity is called the stcmdard error per urtiL If 
or^ is the error variance per unit and there are r replications, the error 
variance of the difference between the. means for two treatments is 
2cr^/r, and the corresponding standard error is A/2o'^/Vr, This result 
is valid unless increased replication necessitates the use of less homo- 
geneous experimental material or of a less careful technique. 

In succeeding sections some advice is given on the (choice of the number 
of replications for an experiment. To present realistic advice is not 
easy. Often the size of an experiment is limited by lack of resoiu’ces 
or by the conflicting claims of other experiments. However, even if it is 
realized that an experiment must fall short of the precision desired, it is a 
good practice to try to estimate the degree of precision that will be at- 
tained and to present this information as part of the proposal for the 
experiment. 

2.21 Number of Replications for Tests of Significance. Consider the 
difference between the effects of a pair of treatments, which might be a 
standard procedure and some new procedure which it is hoped will prove 
better than the standard. The precision desired in an experiment may 
be specified in either of two waj^s. We may specify the size of the true 
difference which the experiment is to detect by means of a test of sig- 
nificance, or we may specify how closely we wish to estimate the true dif- 
ference, by stating the width of the confidence interval that we would 
like to have for the true difference. 

Although the specification used is to some extent a matter of individual 
taste, the approach by means of a test of significance is helpful mainly 
in the initial stages of a line of research. The reasoning in this approach 
is roughly as follows. If it can be established that the new procedure is 
superior to the standard procedure by at least some stated amount^ say 
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20% of their mean, then discovered a tiseful i*esiilt. On 

the other hand, if the first experiment shows no significant difference 
between the two treatments, we will be discouraged from further in- 
vestigation of the new treatment. Consequently, the first experinienf. 
must be large enough to ensure that if the tme difference is 20% or more, 
it is highly probable that we will obtain a significant diliereiice. Note 
that in this approach we do not insist that the first experiment give a 
precise estimate of the true difference, but merely that it give a signifi- 
cant result at the chosen level of significance. 

The probability of obtaining a significant result depends on the true 
standard error a per unit, the number of replications r, and the iniinber 
of degrees of freedom n that the experiment provides for estimating tlie 
error variance. The exact calculation of the probability is rather com- 
plicated; tables have been given by Neyman et oL (2.1) and otliers. The 
argument used in the following example, though logically faulty, leads 
to an approximation that is good enougli for most prac^^'^**^^ 

Example.. The true standard error per unit is 12% of the nu in the 
experiment, and there are 8 replicates and 28 d.f. for thi c timiU o( 
error variance. What is the probability of obtaining a sigiuficain result 
when the true difference is 20%? 

Let d be the observed difference and 8 the true dilfcrc^nee bet-weim th(^ 
mean of the new treatment and that of the standard treat meni. 
true standard error of d is 

^ ^ b% 

111 practice we do not laiow o', and in its place we use ilu^ <\sriniatc»d 
standard error per unit obtained from the analysis of variance of the 
results. We will assume that s happens to be equal to cr. 

The test of significance of d is made by means of a Student /-test, wit h 
28 d.f. To find the value that d must attain in order to be significant, \ve 
multiply the standard error of d, 6%, by the sigiiifii'ant value of /. 
Suppose that the test is made at the 5% significance lovrL and 1hat> it is 
a one-tailed test.* The 10% value of / in the standard tabius. for 

A onotailed test is used when it Is known that fhc new treat incnit must he at 
least as good as the standard procedure. The diffenauH^ d U di‘claivd 
only if d is sufficiently large and ponitwe; if d is negatiw we aserihe n\sult, t<} ex- 
perimental errors, since we do not eiitertani the possilnlity that, tla* new ti'ealiinmt 
could be inferior. A two-tailed test is made when we lio not know whiefj treatment 
is better; large values of d, whether positive or m^gative, are declared slgnifit^aid. 
All standard tables of the significance levels of / art^ computed for two-taikd t<‘s!s. 
For a one-tailed test wo read the value of / corresponding to twice ilie significance 
probability. 
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28 d.f . is 1.701, so that if d is to be significant Ave must have 


d > ^~ sti = (6) (1.701) = 10.206 

The problem is to find the probability that this happens. 

The quantity 

d-d d-20 
'2 ~ 6 

follows the ^-distribution with 28 di. But it is evident that 

^ , d- 20 10.206 - 20 

d > 10.206 if and onl}^ if (2 = — ^7— > 7 = —1.632 


6 


6 


Hence the probability which we seek is the probability that a value of 
/.j with 28 d.f.j should exceed —1.632. In the table of t, the probability 
that a value lies outside the limits ±1.632 is about .11. The proba- 
bility wanted is therefore [1 — (h2)(0.11)], or about .945. It is highly 
likely that a significant difference would be found in this experiment. 

The flaw in the argument is the assumption that the value of s can 
be assumc'd equal to cr, whereas s is actually a random variable. If the 
argument is carried through algebraically, it gives the following rule for 
calculating the approximate probability P of obtaining a significant re- 
sult. 

Step L Find ^2 from the relation 


-cr(ti + ^ 2 ) 

fr 


( 2 . 1 ) 


where 5 — true difference that it is desired to detect. 

(T = true standard error per unit, 
r = number of replications. 

ti — significant value of t in the test of vsignificance (with care to 
distinguish between one-tailed and two-tailed tests). 

Step 2. 

P - 1 - (H)P2 (2.2) 

where 2>2 is the probability corresponding to t 2 in the ordinary stable. 
The degrees of freedom in ti and are those available for the estimate of 
error variance. , , 
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TABLE 2.1 Numbee of bbplications hkqtjIRBP for a given pbobamuty op 

OBTAINING A SIGNIFICANT KBSDLT 

Upper figuroi Test of significaiiee at level, probability 
Middle figure: Test of significance at ,5% level, probability 90 Vo 
Lower figure: Test of significance at 1% level, probability Oo/i, 
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The nile may be inverted in order to jxive the nninbor of rt'})li(‘iiiioii,s 
?• required for a. given proiiabilily P of obtaining a .signifieanl result. In 
this form we have 


r > 2 


(h + 


where S = true difference that it is desired to detect. 

<T = true standard error per tinit. 

h ~ significant value of t in the test of signifieanee. 

tz - value of t in the ordinary table corresponding to 2( 1 — P). 


2.21 NUMBEE OF REPLIGATIONB FOE, TESTB OF SIGNIFICANCE 21 

TABLE 2.1 (Confmued) 

Two-tailed tests 
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Notes, In constructing the table, it was assumed that the number of degrees of 
freedom for error is 3(?* — 1); this would apply in a randomized blocks exporimont 
with 4 treatments. 

No entries are given where more tlnin 50 replications are required. 


The use of this equation is sliglitly tricky, because tlie number of 
degrees of freedom in h and t 2 depends on r. Trial and error may be used 
until the smallest r satisfying (2.3) is found. 

Exmnple, In the previous example, how many replications are re- 
quired ill order to have a chance of 4 in 5 that a significant result will be 
obtained? Here we have 5 — 20, <r = 12, and we will suppose that the 
experiment has 4 treatments in randomized blocks, which gives n = 
3(r — 1) degrees of freedom for the estimate of error. In the /.-table, 
is the value for probability .10, while is the value for probability 
2(1 -- .80) or .40. 



w 
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In order to stot the some guess must be made about the 

value of n» It does not matter greatly if the guess is inaccurate; we 
try 71 - 30. For this, 

h ^ 1.697; t 2 = .854; so that (h + fe) — 2.551 

Hence 

r > 2 (i>^o)^(2.551)" - 4.7 

Consequently our second trial value is r == 5. Since this provides only 
12 instead of 30 di., we must repeat the calculation to verify whether 
5 replications are suflacient. For 12 di. (h + 1^2) is 2.655. Thus 



2 + ^2)" = 2 (2.655)= = 5.08 

Since this Is greater than 5, we conclude that 5 replications arc not cpiite 
enough, and that 6 is the smallest number of replications that will 
satisfy the conditions. We can be sure without further computation 
that 6 will satisfy the conditions, because the right-hand side of the in- 
equality always decreases when we increase r. 

Table 2.1, computed by this method, shows the minimum number of 
replications for a range of values of d and 0*, and is intended to be used 
in the planning of experiments. Usually it is most coin^eiuent to tJiink of 
8 and <r expressed in percentages of the mean per observation for tlu^ 
experiment, and the range in the table has been constructed witJi this 
in mind. Since the value of r depends only on the ratio <r/dj it dcjes m.d 
matter wdiat imits are used for 8 and cr, pro^dded that they are the same 
for both. There are three entries for any given pair of values of 8 and <r. 
These show, in descending order, the replications required for a test of 
significance at the 5% level and a probability .80 of getting a significant 
result, for a 5% level and probability .90, and for a 1% level and prob- 
ability .95. The top half of the table applies to one-tailed tests, the 
lower half to two-tailed tests. 

Construction of the table requires some assum])tion about the mza of 
the experiment. In the case presented the experiment has four trc^at.- 
ments in randomized blocks, so that n = 3(r - 1). Tiie va.lu(‘S in the 
table change little if the experiment happens to provide m<.)re iluin tliis. 
If the experiment has only two or three treatments it is advisalde to 
check the value of r by means of the formula. 

In the practical use of the table it is necessary to estimate the value of 
the true standard error per unit. Often this can be done fairly well from 
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the results of previous experiments with the same kind of material. It 
should be realized that the probabilities to which the table applies will 
be correct only if the value of cr is estimated correctly. If o- is under- 
estimated, the probability of obtaining a significant result will of course 
be smaller than the presumed value. For cases -where it is highly im- 
portant that the probability be correctj more exact metliods are dis- 
cussed in section 2.23. 

As an illustration of the reading of the table, consider the case in the 
hist example, where 5 == 20, o* = 12, P = .80, and the test is a one-tailed 
test at the 5% level. For this we read the top entry in the top half of 
the table for d = 20, a — 12, and find that 6 I’eplications are necessary. 
From the middle figure we see that an increase to 7 replications in- 
creases the probability to .90. If the test is to be two-tailed, we read at 
the corresponding place in the lower half of the table, and find that 7 
replications are required for a probability .80 and 9 for a probability ,90. 

The table brings out the value of any reduction in the standard error 
per unit. In fact, w'e cannot have a high probability of detecting a 6% 
difference with any reasonable number of replications unless the stand- 
ard error per unit is 4% or under. Differences of 20% or over can be 
detected in most cases without excessive replication. No entries have 
been inserted in the table for situations where more than 50 replications 
would Ije reciiiired, since experiments of this size are uncommon. 

2.22 Number of Replications for Prescribed Limits of Error. The 
confidence limits for d, the true difference between the effects of two 
treatments, are 



where t is the value in the stable corresponding to the confidence prob- 
ability chosen and to the number of degrees of freedom 7i in the estimate 
of error. The quantity may be called the limit of error, since 

it measures the maximum distance between d and 8 for the confidence 
probability chosen. Since the limit of error depends on s, it is not known 
until die experiment is completed. If s is replaced by its average or 
expe<‘,ted value, we obtain the expected limit of error 
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The factor /enters because the average value of s is slightly less than v; 

to a close approximation, / equals ^1 — 

From relation (2.5) table 2.2 was constnicted to show the numbers of 
replications required for various values of L and for conhdence probabil- 

TABLB 2.2 NmiBBa op beplic.wions EEaiuKBD for given limits op error is 

THE mFFEllENCE TVTO TREATMENTS 

Upper figure: Coiifkleiice probability ,80 
Middle figm'e: CWfidence pix)babiH 
Lower figure: Confideiiee probability .95 
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ities of ,80, ,90, and ,95. As in table 2,1, the experiment was assunu^i in 
have SCr — 1) degrees of freedom in the estimate of error; no entries were 
inserted where more than 60 replications are necessary, and the values 
of L and <r are expressed as percentages of the mean for the experiment. 
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This table may be used in the planiiiiig of an experiment where the 
object is to estimate the difference in the effects of two treatments. Sup- 
pose that the true standard error per unit is expected to be about 6% 
and that prelmiinaiy experiments have indicated that the true difference 
between the effects of the two treatments is around 25%, In this ease 
we would probably not be satisfied with a value of L any higher than 
5%; if L is 5% and if the observed difference is found to be 22%, we 
are confident that the true difference lies between (22 ± 5)%, that is, be- 
tween 17 and 27%. From the entry for cr == 6, L = 5, we find that 6 
replications siijEce for a confidence probability of 80%, 9 for 90%, and 
12 for 95%. 

It is important to reiterate that this table does not gtiamniee that the 
limits of error will be of the desired size, even if the value of a- is fore- 
casted correctly. The actual limits will depend on s and may be either 
larger or smaller than L. They will be about the desired size on the 
average if cr is guessed correctly. Further, the table makes it clear that 
small treatment differences of the order of 5% cannot be accurately 
estimated in a single experiment, at least with the values of a and r that 
normally prevail in many lines of research. For an accurate estimate of 
a 5% true difference, we would probably not want L any larger than 1%. 
This value was not included in the table since it demands great numbers 
of replications. 

2.23 The Case Where Additional Assurance Is Desired. We have 
seen that tables 2.1 and 2,2 do not ensure that the desired standards of 
accuracy will always be attained. In table 2.1 the probability of obtain- 
ing a significant result is as stated only if the value of u is correctly 
guessed. In table 2,2, if cr is coixectly guessed, the limits of error will 
equal the desired L on the average, but for a single experiment may be 
either higher or lower. In certain circumstances, methods are available 
which provide a more definite assurance. 

If an initial estimate of cr^, based on ni degrees of freedom, has been 
obtained from preliminary data, Harris, Horvitz, and Mood (2.3) present 
tables that give the number of replications required so as to be certain 
that the probability of obtaining a significant result will have the de- 
sired value. No guesswork about or is involved, though if ni is small the 
number of replications required tends to be large so as to allow for the 
possibility that 8%^ is a very poor estimate of cr^. With the same pre- 
liminary data a value of r may be found such that the probability is 
.05 that the limits of error exceed L. If in default of an estimate Si^ we 
can give a range of values within which is likely to lie (e.g., from pre- 
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vious experiments), the <autliors suggest a simple method for using their 
tables. 

An alternative approach, due to Stein (2.4), is applicable when the ex- 
periment can be interrupted after a given number of replications. This 
approach ensures that the limits of error will not exceed L. The pYu- 
cediire is as follows. We first complete a number of replications vsome- 
what less than the number which we expect will be necessary. At this 
stage we compute the limits of error. If these are already less than L, 
the object is achieved. If they are greater than L, Stein gives a simple 
rule for computing the additional number of replications needed to give 
limits that are less than L. 

Naturally, these methods require more replications than tables 2.1 
and 2.2. In fact, if used repeatedly, they give a standard of aecurac'y 
which on the average is greater than we desire, in order to ensure ihiit 
the standard is seldom if ever less than what is wanted. For this rea- 
son, tables 2.1 and 2.2 appear to be suitable for ordinary experimenta- 
tion; the methods described here should be used when it is very impor- 
tant to reach a specified standard of accuracy. 

2,3 Other Methods for Increasing Accuracy 

2.31 The Measurement of Relative EfSciency* We have seen that 
the error variance of the difference between the means for two irc^af- 
ments is 2a^/r, where r is the number of replications. This result pj-o- 
vides a simple means for comparing, in concrete terms, the relative 
precision of two experiments. Suppose that one design gb^es a ti’ue ei'ror 
variance of 1.0 per experimental unit, while a second gives 0.5. The 
two experiments would give equally precise comparisons among the 
treatment means if the amount of replication in the first were tvice that 
in the second. From a Imowledge of the error variances per unit, com- 
parisons are thus easily made in terms of the relative amounts of replica- 
tion required .to attain the same degree of precision. The inverse ratio 
of the variances per unit is sometimes called the relative ejficicncy of the 
two designs; in the example above, the relative ejffieiency of the secuind 
experiment to the first is 2.0 or 200%. Similarly we might speak of the 
relative gain in efficiency as 1.0 or 100% in this case. These terms oc<*ur 
frequently in the literature on experimental designs. 

One further factor must be taken into account. For the same numbers 
of treatments and replicates, the number of degrees of freedom in the 
estimate of error changes with the design. This number enters into many 
of the uses to which the experimental results are put. When the ruiinbei’ 
of degrees of freedom becomes smaller, the limits of error for a true 
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difference are increased and the probability of obtaining a sigiiificant re- 
sult is decreased; in other words, the sensitiyity of the experiment is de- 
creased. 

Thus the numbers of degrees of freedom for error are relevant to the 
comparison of two designs. A change in design which decreases the error 
degrees of freedom as Avell as the error variance may not be advantageous. 
This leads to the question: by how much must the experimental error 
variance be reduced so as to balance a given reduction in the error degrees 
of freedom? The question has been discussed by Neyman et aL (2.1), 
Fisher (2.5), and Walsh (2.6). The ansAver depends on the use to AAhicli 
the results are put. From one point of vieAV, tAvo experiments may be 
regarded as equal in sensitivity when the limits of error for a given con- 
fidence probability are equal. Alternatively, equal sensitivity may 
imply an equal probability of detecting as significant a given real dif- 
ference betAA^een the effects of tAVo treatments. Other definitions could 
be set up. 

Table 2.3 illustrates the situation. An experiment Avith a true error 
variance per unit equal to 1 (i.e., AAdth an infinite number of degrees of 
freedom) was chosen as the standard. The table shoAvs, for a range of 
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values of the error degrees of freedom (n), the true error variances per 
unit Avhich are required to equal the standard in sensitivity according 
to four definitions of this term. In case 1 the average limits of error for 
a confidence probability .95 are made equal. In case 2 the same cri- 
terion is used for a confidence probability .80. In case 3 all experiments 
have the same probability .25 of detecting as significant a constant dif- 
ference betAveen two treatments. Case 4 shows the same comparison for 
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a larger constant difference where the probability is .75. 1 he tests ot 

significance are one-tailed il-tests at the 5% level. 

For example the figure 0.70 for n ~ & (definition 1) ine<ins that with 
a true error variance per unit of 0.70 and 6 d.f. in the estimate of error, 
the limits of error for a confidence probability .95 are the same on the 
average as when is 1 and n is infinite. A 30% decrease in error vari- 
ance is needed to compensate for the lack of degrees of freedom. 

The wide divergence between the results for definition 1 and defini- 
tion 2 is noteworthy since the only change in criterion is that from a 
confidence probability .95 to one of .80. Definitions 3 and 4 exhibit 
much closer mutual agreement and give results intermediate between 
those of definitions 1 and 2. In practical applications the discrepancies 
among the four methods will be smaller than table 2.3 suggests, be- 
cause comparable designs for the same experiment seldom differ greatly 
in the degrees of freedom available for error. 

Fisher’s approach (2.5) is somewhat different. In effect, he calculates 
the “ainoimt of information” which the estimated difference d between 
two treatment means supplies about the true difference 5, the calcu- 
lation being made from the f-distribution of the quantity 

He finds the information to be (n + l)/(n + whereas if (t were 
known exactly the information would be l/V^. The cstimat{.‘d varianees 
which give equal amounts of information are shown in table 2, L 

TABLE 2.4 Estimated variances which provide EQirAL Auovxm of 
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This table agreesS closely with the results for cases 3 and 4, c^xeept 
when n is below 5. For general purposes it is suggested that tins table 
be used to take account of the differences in degrees of freedom for error 
in two designs that are being compared. Suppose that a design with 
n 6 is compared with one in wdiich n « 12. The former gives more 
information only if the error variance si^ is less than ( 0 . 778 / 0 . 867 )^ 2 ^'. 
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Thus the relative ef&ciency of the design to the is estimated 
as 0.778 s2V0.8676‘i^ or more generally as 

(% + l)(n2 + 3)^2^ 

The adjustment is of importance only if ni and n 2 are small. 

This expression applies to the simple case in which the tw’^o experh 
ments have the same set of treatments. The experiments ma^^, however, 
have different sets of treatments, although each provides an estimate of 
some specific treatment effect in which we are interested. Alternatively, 
a different method of estimation of the treatment effect may have been 
employed in the two experiments. In such eases we may wish to assess 
the relative accuracy with which this treatment effect is estimated in 
the two experiments. The expression given above still applies, except 
that Si^ and S 2 ^ become the estimated variances of the treatment effect 
as found in the two experiments, wliile rii and n 2 are as before the num- 
bers of degrees of freedom in sff and ^ 2 ^ respectively. 

In succeeding sections a number of other devices which affect the 
accuracy of an experiment are presented brieflj^ 

2.32 Selection of Treatments. In certain cases the selection of the 
treatments has a substantial effect on the precision of an experiment. 
As Fisher has stressed, striking gains in precision may be achieved by 
testing diffenuit types of treatment in the same experiment, instead of 
conducting a separate experiment for each type. For instance, one 
type of treaiinent or factor might be the depth of ploughing m the 
pre^paration of a field for a crop, while another might be the addition of a 
nitrogenous fertilizer. We might conduct one experiment in which 
deep (D) and shallow (S) ploughing are compared, and another in which 
no fertilizing (0) is compared with the addition of a specified amount of 
fertilizer (A), In a factorial experiment both factors would be tested 
simultaneously by means of the 4 ^ffreatments^^ — (DO), (80)^ (ZW), and 
(aSW), where the symbol (DO) implies deep ploughing with no addition 
of nitrogen, etc. The average response to nitrogen is assessed by com- 
paring the last tW'O treatments with the first two. This comparison 
gives as precise an estimate of the average response to nitrogen as if the 
whole experiment had been devoted to that factor alone. The same 
property holds for the estimation of the average difference in effect be- 
tween deep and shallow ploughing. Thus the experiment, as it were, 
does double duty. The use of factorial experimentation is discussed 
more fully in chapter 6. 

There are many specialized problems where the choice of the proper 
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amounts of some ingredient is important. As a veiy example, 

suppose that the response to increasing amounts of the ingredient is 
known to be linear and that the purpose is to determine the slope of the 
line. The most accurate experiment contains only two amounts oi 
ingredient. These should be placed at the ends of the range within which 
the response is linear and experimentation is feasible. With three 
amounts of iiigredientj two at the ends and one in the niiddlej the 
variance of the estimated slope, for the same total size of experiment, is 
1 1/^ times, as large, and if more amounts are used it becomes larger still 
A more complex example occurs in the estimation of the ainount of a 
virus in a solution by pi^eparing a series of dilutions oi the soiiition and 
counting the numbers of lesions produced on leaves wdiich are rubbed 
with the dilutions (2.7). The accuracy of the estimate is known to 
depend both on the dilution ratio that is chosen and on the number of 
dilutions that are used. 

2.33 Refinements of Technique. Since teckmgue is tlie responBibilit.y 
of the experimenter, its importance need not be elaborated. The prin- 
cipal objectives of a good technique are as follow^s. 

i. To secure uniformity in the application of the treatments. In 
pig-feeding experiments, for instance, a uniform amount of ftjod (amnot 
be supplied to each animal without the provision of individual fec^ding 
boxes. In the testing of insect sprays, delicate apparatus is na.|iiir(H! 
in order to subject each butch of insects to a desired dose of spray. 

ii. To exercise sufficient control over external iuHuences so tliat 
every treatment produces its effects under eomparal)Ie and dasii*ed con- 
ditions. It is difficult to generalize about the degree of control needed; 
a balance must be struck between the cost incurred and the gain in 
precision obtained. The artificial production of diseases for experiment s 
on resistance to infection exemplifies a case wffiere experimentation can- 
not proceed rapidly without such control over external conditions. 

iii. To devise suitable unbiased measures of the effects of the treat- 
ments. Often the appropriate measurements are readily apparent; 
sometimes the development of a satisfactory method of rm?asurem(uit 
requires years of research, as in the estimation of (^eriaiii of the vitamins, 
in soil analysis, and in sociological investigations. 

iv. To prevent gross errors, from which no type of experiment at i«>n 
seems to be entirely free. Adequate supervision and cheeking of 
work of assistants and a scrutiny by the experimenter of tlie data from 
every experimental unit will go far towards the discovery and rectifica- 
tion of errors. 

Faulty technique may increase the real experimental errors in two 
ways. It may introduce additional fluctuations of a more or k'ss 
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random nature. Such fluctuations, if they are substantial, should reveal 
them>selves in the estimate of error as calculated in the analysis of 
variance (chapter 3). Where his estimated standard errors are con- 
sistently higher than those of other workers with similar material, the 
experimenter is advised to seek the reason, which may lie in differences 
in technique. In addition, faulty technique may result in measurements 
that are consistently biased. The estimate of error does not take ac- 
count of such biases, since it is derived from comparisons of the measure- 
ments with one another; in other words, it estimates precision rather 
than accuracy. The principal safeguards against such biases are care 
and skill in the construction and handling of measuring devices, plus the 
intelligent use of randomization. 

It is worth while to consider from time to time whether simplifications 
can be brought into the technique without undue loss of accuracy. 
Many experiments involve chemical determinations which vary little 
from unit to unit within the same treatment. For instance, in sugar- 
beet experiments, the experimental error of the amount of sugar per 
acre is due mainly to the error in the weight of roots per plot. The 
amount contributed hy the sugar content percent is nearly negligible. 

With compound measurements of this type, the labor devoted to the 
less variable component can be reduced with little loss of accuracy. If 
the ehemk*.al measurement is expensive, a considerable saving in cost 
may be made. Tests of significance of the chemical composition need 
not be sacrificed. For example, if the experiment has six replicates, a 
chemical analysis might be made for each treatment on bulked material 
from the first three replicates and another on bulked material from the 
last three replicates. These data provide a valid test of significance based 
on two “replicates.^' 

2.34 Selection of Experimental Material. The choice of the experi- 
mental unit may be of importance. In the planning of field experiments 
numerous studies have been made of the variability among crop yields 
on plots of various sizes and shapes under uniform treatment. From 
these data the best size and shape are selected. The criterion should 
be to obtain the maximum accuracy for a given expenditure of time and 
labor. 

Freciuently, uniform material is prepared specially for experimental 
purposes, as in the development of inbred lines by laboratories engaged 
in animal experimentation. Alternatively, the experiment may be con- 
fined to a sample chosen for its homogeneity from a large batch of ex- 
perimental material. If the results of the experiment are to be applied 
to unselected material, these types of specialization have potential dis- 
advantages, Responses obtained to the treatments in highly selected 
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material may not be the same as the responses in nnselected material. 

Selectivity in the material is difficult to avoid viiere experiments are 
conducted in the field of economics or sociology. In testing some method 
of farm management, for instance, the experunent may require the active 
participation of a group of farmers. The success of such experiments 
depends greatly on the tact and resourcefulness of the investigator in 
persuading farmers to cooperate so that the participants are a represent- 
ative sample of the population about which generalizations are to be 
dravTi. 

2.35 Additional Measurements. In the eoui-se of an experiment it 
may be possible to take supplementary measurements which predict at 
least to some extent the performances of the experimental units. In an 
experiment to measure the effects of diffeient diets on the weights of 
children, their w'eights at the start of the experiment would be a supple- 
mentary measurement of this kind, since the increase in weiglit of a 
child durmg the experiment is probably coi-related with his initial 
weight. 

By a technique known as the analysis of covariance, we may estimate 
from the data the extent to which the observations were influenced by 
the variations in these supplementaiy measurements. The average 
response to each treatment can then be adjusted so as to remove the 
experimental error arising from this source. In the school (?xperini(uit, 
the adjusted responses to treatments represent approximately the re- 
sponses that would be obtained if all the children had the .same initi!i] 
weight. Thus the effects of variations in the initial weights are largely 
eliminated from the experimental error, without the necessity of equal- 
izing the initial weights in the planning of the experiment. 

Since the analysis is a statistical technique of comparatively recent 
origin (it was first presented by Fisher about 1932), it may be unfamiliar 
to many investigators who could use it to obtain a substantial increa.se 
in precision with little effort. Its purpose is to remove experimental 
errors arising from extraneous sources of variation which if, is impractical 
or too costly to control by a more refined techni{iue. Cases an* fairly 
common where the use of covariance has more than doubled th(> <*fii- 
ciency of the experiment. The e.xtra work involved consists in tsikirig 
the subsidiaiy measurements and in applying the adjustments to the 
treatment means. An example of the computations is given in sei-fioti 

to minim i ze the experimental errors through the choice of the experi- 
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mental plan. The prospects of increased accuracy by this means have 
been widely explored during the past twenty years. This book is in- 
tended as a source of reference to the numerous procedures that have 
been developed. 

The basic idea is simple. Consider an experiment of which a number 
of separate replications have been conducted. The experimental errors 
of the restilts from any i^eplicate can arise only from sources of variation 
that affect the units within that replicate. Consequently, the error of 
the difference between two treatment means taken over a number of 
replicates must also arise solely from variations wifhm the individual 
replications. Variations from one replicate to another do not contribute 
to tlie errors. In canying out an experiment we utilize this simple fact 
bj?" trying to control sources of valuation that affect different units in the 
same replicate. We need not attempt to reduce differences among the 
replicates. For example, if the experimental units form a very heter- 
ogeneous batch, we attempt to group them so that units in the same 
replicate are similar; we do not worry if the units in one replicate are 
not at all similar to those in another. By this device, precise experi- 
ments can often be made from what at first sight appears an unpromising 
batch of material. Similar^, if a uniform experimental technique cannot 
be maintained throughout the experiment, the important point is to 
keep the technique uniform within a replication; changes should be 
made when moving from one replicate to another. This type of design, 
known in agriculture as randomized blocks j is discussed in section 4,2. 

The idea is carried a stage further in the latin square (section 4.3). 
The treatments are arranged diagrammatically so that each appears once 
in every ro^v and once in every column of a square array. Variations 
among the groups of experimental units which correspond to the rows 
and also among those which correspond to the colimms are eliminated 
from the experimental errors. For example, in a comparison of 5 dif- 
ferent w^ays of performing some manual operation, the rows may repre- 
sent 5 operators, each of whom carries out all the methods in tium. The 
columns, numbered from left to right, may prescribe the order in which 
each operator perfoims the 5 tasks. The plan of a latin square design 
for this experiment might read as follows. 

Order of procedure 


Operator 

(1) 

(2) 

&} 

(4) 

(5) 

1 

D 

A 

B 

E 

C 

II 

A 

B 

€ 

D 

B 

III , 

B 

E 


C 

D 

ly 

€ 

D 

B 

B 

A 

V 

B 

C 

D 

A 

B 


34 


METHODS FOE INCREASING ACCURACY 2.36 

The letters 4, J5, C, D, i? represent the 5 methods. Thus operator I 
carries out method A next, and so on. ^ With this ar- 

rangement, the experimental errors are unaffected by differences among 
the abilities of the men and also by systematic variations introdu(3ed by 
the order in which the methods are performed. If the opportunity 
presents itself, this type of control may include a third factor by the use 
of a graeco-latin square (section 4.5). 

2.37 Planned Grouping : Incomplete Blocks. In the randoinixed block 
design, the precision depends to some extent on the number of treat- 
ments. As this number increases, it becomes more difficult to keep the 
variations between experimental units within each replicate small, and 
the error variance per unit tends to increase. In experiments with a 
large number of treatments, numerous attempts have been made to 
avoid this loss of precision by the formation of groups of experimental 
units which do not contain all the treatments and therefore can remain 
small These groups are called incofriplete blocks. The groups are so 
constructed that we can still remove the effect of the differen<3es among 
groups from the experimental errors, just as such differences are auto- 
matically removed in the randomized blocks design. With incomplete 
blocks, however, this removal usually requires the application of adjust- 
ments to the treatment means, so that the statistical analysis of the re- 
sults becomes more complex. 

The best method of constructing the incomplete blocks varies witli the 
nature of the experiment. Sometimes certain comparisons among tivaU 
ments are of greater interest than others. For instance, in a field experi- 
ment containing a number of early-maturing and an equal number of 
late-maturing varieties of a crop, comparisons between pairs of varieties 
which mature at about the same date will presumably be more important, 
than comparisons between an early and a late variety, jin incomplete 
block might contain either all the early or all the late varieties, a pair of 
blocks forming a complete replication. This arrangement, of which 
severd.1 variations are possible, is known as the split-plot design (section 

A similar situation occurs in factorial experiments. Certain com- 
parisons, e.g., the average effects produced by varying one factor, iim 
nearly always of immediate interest. Others, which reprc^^sent rather* 
complex relationships among the effects of dUTerent factors, may bo of 
minor importance, often because previous experiments have shown that 
these interrelationships, called interactions^ exeit little or no mfiuenee. 
It has been found that, by a deliberate sacrifice of precision in the 
estimates of these interactions, the size of the group or block can bo re- 
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diiced BO as to produce an increase in the precision with which the average 
effects of the factors This principle, known as con- 

fotmdmgy can also be combined with the use of the latin square (chapters 
6 and 'S), ■ ■ 

If all comparLsons bet\?een pairs of treatments are potentially of equal 
importance, a different method is used in forming the blocks. When 
treatments are arranged in incomplete blocks, two treatments which oc- 
cur in the same block are more precisely compared than two which are 
placed in different blocks, at least in so far as the investigator has suc- 
ceeded ill decreasing the variation within blocks. Therefore the goal is to 
construct a design such that m?/ %^air of treaimsnis occurs equally often 
within some block. This type of problem has been investigated pre- 
viously for its mathematical intere>st. A solution can be found for any 
number of treatments and any sfae of block, but most of the solutions 
require too many replications for the usual conditions of experimentation. 
The solutions available to date {called balmiced mcomplete Mock desigiis 
(chapter 11)) are given for ail cases in ivhich the number of replications 
does not exceed ten* Here again the latin square principle can some- 
times be used (chapter 13) to eliminate variation amongst two different 
types of grouping of the experimental units. 

For a given nmnber of treatments and a given size of incomplete 
blo(*k, balanced designs allow little choice in the number of replications. 
Tlius, with 64 treatments and blocks of 8 units each, 9 replications are 
rcciuired to balance the design. In order to provide designs for small 
numbers of replications, a number of additional types have been de- 
veloped; these are similar to balanced designs except that they lack 
complete symmetry. 

One group, called lattice designs (chapter 10), has been used widely 
in agricultural experiments during the past 5 years. For these designs, 
the number of treatments must be an exact square, e.g., 25 or 49, while 
the number of units in the block is the corresponding square root, 5 or 
7, respectively. A further group, known as cubic lattices (section 10.4), 
is useful when the number of treatments exceeds 100 yet it is desirable 
to have a small block. In this case the number of treatments is the cube 
of the number of units per block, and the number of replications may be 
any nmltiple of 3. 

2.38 Summary, There are numerous methods available for increasing 
the accuracy of an experiment. Frequently the same end may be 
readied in several different ways. Thus, in the experiment which tests 
the effects of a number of diets on the weights of children, the disturbing 
effects of variations in their initial weights can be reduced by selecting 
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for the experiment a number of children of approximately the same, initial 
weight, or by the use of the analysis of covariance, or by a planned 
grouping in which children within the same replicate are of approxi- 
mately equal initial weight, or simply by having a large numljer of 
replications. The method adopted should be that for which the deKsired 
standard of accuracy can be attained with the smallest expenditure of 
time and effort. There is no special merit in either a complicated ex- 
perimental plan or a highly refined technique if equally accurate results 
can be secured with less effort in some other way. A good working rule 
is to use the simplest experimental design that meets the needs of the 
occasion. This is not to vSay that the more complex designs will be used 
only rarely; in fact, they have already demonvstrated their utility in 
fields of research where no other equally practicable method appears to 
exist for reaching the same results. 

2.4 The Grouping of Experimental Units 

2.41 Investigation of Methods for Grouping, In order to take full 
advantage of the opportunities for increased precision by suitable group- 
ing of the units, the investigator must know the best criteria for group- 
ing. Frequently these are suggested from previous o]:)servations on th{3 
experimental material or on disturbing factors that become evident in 
the course of an experiment, ^Yhere knowletlge is less definite, us in a 
new line of research, there are two ways in which informatimi may be 
accumulated. 

One is to devote an experiment specifically to this problem. In tins 
case it is usually advisable to subject all units to the same treatment, 
carrying out a so-called uniformity trial The word uniformiUj refers to 
uniformity of treatment; the experimental material and technique should 
be representative of those which are used in actual experiments. The 
amount of material should be sufficient to enable the experimenter to 
superimpose on the results a hypothetical trial of the size wliieh he cus- 
tomarily performs. It is worth while to record any auxOiaiy measure- 
ments which might predict the perfomiance of the experimental units. 
From the results of these trials any proposed grouping of the xuiits <?an 
be formed and the amount of variation within and among the groups 
can be calculated. If it is desired to compare different experimentul 
plans, each may be superimposed on the results of the uniformity trial. 
By the method indicated in section 2.31, the investigator may estimate 
the relative numbers of replications that are needed to reach the same 
degi^ee of precision with the various plans. , 

Secondly, useful information can sometimes be secured without any 
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special trial or technique by examination of the results of actual experi^ 
mmtH. With an experiment which is suitably designed, it is possible to 
estimate fi’om an analysis of variance of the results what the error vari« 
aiice would have been if any particular grouping had not been used. This 
estimate, being derived from the same data, is directly comparable with 
the error variance obtained by the use of the grouping and enables the 
investigator to evaluate the success of the grouping. Consequently, for 
experiments in a new line of research, the experimenter may try any 
method of grouping that migM he effective, in the knowledge that its 
appropriateness wvill be revealed by the experiments themselves. In- 
structions for performing such comparisons, which are easily made when 
the analj^sis of variance has been completed, are given in the notes which 
accompany each type of design. 

2.42 Criteria for Grouping, Many factors have been used as the basis 
for grouping. In agricultural experimentation it has been repeatedly 
demonstrated that plots close together tend to be more similar in their 
yields than plots farther apart. The almost universal practice is to put 
neighboring plots in the same group, with the groups approximately 
square in shape whenever practicable. Contiguity frequently forms a 
good basis for grouping in greenhouse experiments also, where dilferences 
may exist along the bench in temperature, sunlight, air currents, or ac- 
cessibility for WTitering. Alternatively, in many plant experiments, 
and more particularly in animal experiments, some characteristic of the 
plant or animal is much more useful, the physical location of the ex- 
perimental units throughout the course of the trial having a relatively 
minor influence on the results. Age, w’-eight, vigor, sex, and genetic con- 
stitution are some of the factors most commonly used. The animal it- 
self may constitute the block in cases where the treatments can be applied 
in succession to the same animal wdthoiit producing residual effects which 
obscure the results. 

A single grouping, as in the randomized block design, can be used to 
eliminate simultaneously variation from a number of different sources. 
For example, suppose that an experiment w^ere planned to investigate 
w^hether some treatment applied to rats enabled the animals to wdth- 
stand subjection to a dose of a poisonous gas. The replication w^ould 
contain two animals, a treated rat and an untreated rat to be included 
as a control Given a sufficient stock of experimental animals, the pairs 
in any replication might be animals of the same litter (and consequently 
of the same age), of the same sex, and of approximately the same weight 
and vigor. If it were convenient to test two animals at the same time, 
the ]pair could be put into the gas chamber together, thus eliminating 
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also the effects of variations in the strength of the dose or in the time for 
which it was supplied. 

The freedom with which the size of block can be varied depends on 
the type of research. In field experiments the block can be laid down so 
as to accommodate any number of treatments, though, as preAuousIy 
stated, the standard error per plot may be expected to increase slowly as 
the size of block increases. TRdien the use of designs with incomplete 
blocks is contemplated for field experiments, the principal question is to 
decide for what number of treatments the reduction in block size be-^ 
comes worth while. On the other hand, the number of leaves on a plant, 
or of animals of the same litter, usually varies Avithin rather nanxnv 
limits. In such cases, where the most homogeneous grouping fixes the 
size of the block, incomplete block designs may be of special interest. 

The gains in efficiency obtainable by skillful construction of tlie 
blocks, when measured in terms of increased replication, are often large; 
examples can be cited in which tlie effective replication has been in- 
creased from two- to tenfold. Frequently, apart from the preliminary 
care and thought involved, an accurate design uses the same material 
and involves the'same physical operations as the design which it replaced. 
In some of the more complex designs the computations reqiiircKi to es- 
timate the treatment averages and to perform tests of significance be- 
come rather laborious and immlved. The experimenter's reaction to 
these difficulties \yiU depend on his aptitude for numerkal computtitions. 
At a first trial considerable time may be spent in mastering Home of the 
methods, but the processes become easier with familiarity. 
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CHAPTER 3 


NOTES ON THE STATISTICAL ANALYSIS OF THE RESULTS 

3.1 Introduction 

With regard to the statistical analysis for a design, the procedure 
adopted in this book is to present the computations, ■\’^dth worked nu- 
merical examples for the designs likely to be most frequently used. To 
supply a complete mathematical justification of the analysis for each 
type of experiment, although in many respects desirable, would have 
unduly increased the amount of material. The same general technique 
governs the analysis of all designs, differences arising only in the adapta- 
tion of the technique to the particular structure of a design. In the 
present chapter this technique— the method of least squares — is de- 
scril)ed and applied to one of the simplest designs. With perseverance 
the method may be used to verify the computing instriictions for other 
designs in this book. It should be pointed out that the computing meth- 
ods that iire easiest in practice are not necessarily those that flow from a 
straightforward application of the theory, so that at first sight some of 
the instructions may appear different from those given by theory. 

Some account will also be given of certain statistical techniques, such 
as the technique for missing values and the analysis of covariance, that 
are particularly useful in handling the results of experiments, though 
they are perhaps too specialized to be included as part of a general in- 
troductory course in statistics. 

3.2 The General Method of Analysis 

3.21 The Mathematical Model. The illustrative data come from an 
experiment conducted in 1935 by the Rothamsted Experimental Station 
(3.1). The object of the experiment was to measure the effectiveness of 
4 soil fumigants in keeping down the mimbers of eelworms in the soil. 
The fumigants were chlorodinitrobenzene (ON), carbon disulphide jelly 
(CS), and two proprietary preparations, ^^Cymag’^ (CM) and ‘‘Seekay^^ 
(CK). Each fumigant was tested both in a single and a double dose. 
These comprize 8 distinct treatments. The control (no fumigant) was 
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the nmtli treatment. Normally the replication would have consisted 
of 9 plots, but in this case 4 control plots were placed in each ''replicate/' 
which actually contained 12 plots. The purpose was to supply a fairly 
accurate standard against which to measure the performance of the 
fumigants. 

The experiment was laid out in 4 separate blocks of 12 plots each, so 
that there were 4 replications for each dose of each fumigant and 16 
replications of the control Within a block the 12 "treatments" were 
allotted to individual plots at random. The experiment is of the 
familiar type laiown in agriculture as randomized blocks (see chapter 4). 
The fumigants were ploughed in during spring, after which a crop of oats 
was sown. After harvest, a sample of 400 grams of soil was taken from 
each plot and the number of eelworm cysts counted for eacli sample. 
The data arc those labelled "second count" in table 3.1. 


TABLE 3.1 Plan and numbisbs of eelworm cysts per 400 grams op 

SOIL, FIRST COUNT ABOVE, SECOND RELOW 



^ In a formal analysis of the results the first step is to set up an eqiia- 
tion for every observation. This equation expresses the observation as 
the sum of four components: (i) a general average about which the 


i 


m2 ASSUMPTIONS the ' 

observatioHS are presiimed to be fluctuating; (ii) a component repre- 
senting the effect of the treatment applied; (iii) a component represent- 
ing ceiiain emuromnenta^ effects which the design of the experiment 
enables us to isolate; and (iv) a residual component, representing all 
other sources that influence the observation, and generally referred to as 
the “expermumtal error.’’ 

Component (iii) cannot be fully appreciated without some knowledge 
of the teclinic|ue employed in the construction of designs. This technique 
consists in arranging the conduct of the experiment so that, in the sub- 
sequent analysis of the results, dilferences among certain groups of ob- 
servations can be measured. What is more important, the effects of such 
differences can be eliminated from the estimates of the treatment effects. 
If this elimination wea^e not feasible, these diffemnces would contribute 
to the ^^experimental error,” component (iv), with the consequence that 
less accurate estimates of the treatment effects would be obtained. In 
the present example, as will be seen, these differences are differences 
among the groups of 12 observations that constitute a block. With 
each design that we describe, the environmental components that are 
eliminated in this way are pointed out. 

We proceed to waate down the mathematical model as applied to the 
example. Instead of numbering the observations from 1 to 48, it is con- 
venient to use a Biibscu’ipt i to denote the treatment applied, j to denote 
the block in whkdi tlie observation lies, and h to denote the order within 
the blocdc. For the plots that receive fumigants, the subscripts i and j 
are sufficient to define each plot uniquely. The subscript k is needed to 
distinguish between different control plots within the same block. In 
this notation the equation for any observation may be written 

Vijk = g + ri + + Oijk (3.1) 

where m represents the general mean, the effect of the treatment, 13^^ 
that of the block, and eijk the experimental error, 

3.22 Assumptions Made in the Model. There is already implicit in 
the model one assumption that may not be true in the data obtained 
from experiments. This assumption is that the treatment effect and the 
block effect are additive. If we take the difference between two observa- 
tions in the same block, say and have, as a deduction from 

the model, 

yijhi - y2M == ri - T2 + (^•^) 

The block effects have disappeared from the right-hand side; that is, 
the model implies that the difference between the true effects of two 
treatments is the same in all blocks. On a block with a high eelworm 
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infestation a successful fumigant is presumed to reduce the eelworm 
numbers by the same amount as on a block with a low eelworm infesta- 
tion. This is unlikely to be so, except as a first approximation. Little 
systematic study of the accuracy of this approximation in practice has 
been made. From general experience it appears that the approximation 
works well in a large number of experiments; nevertheless, cases are not 
infrequent where the approximation is poor. 

In addition, several assumptions are made about the residual effects 
Cijjc. These are taken to be independent from observation to observation, 
and to be distributed with zero mean and the same variance Further, 
for tests of significance and the estimation of confidence limits, the e^jk 
are assumed to follow the normal or Gaussian frequency distribution.' 
In practice, these assumptions are only approximately fulfilled. The 
consequences of errors in the assumptions will be indicated later (section 
3.9). 

Apart from changes in detail, the type of model and assumptions 
presented for this example apply to all experimental arrangements in- 
cluded in this book, except for two types of design for which a more 
complex model is required. These are the split-plot dmgnB ((fiuipter 7), 
where two or more different error variances are postuiated, and wi- 
complete block designs (chapter 9), where additional assumptioiLS are made 
about the nature of the environmental effects ft-. 

3.23 Estimation of the Treatment Effects. When the assiiiiiptions in 
the previous section are valid, there is a well-known result in statistical 
theory that the best estimates of the iinkiaowns /u, are obtained 

by the viethod of least sq^iares. This method chooses estimates m, i/, and 
bjf respectively, which minimize the sum of squares of the residuals 

L (2/iift - m - t{ - bjf (3.3) 

taken over all observations. 

In order to find these minimizing values we differentiate the sum of 
squares with respect to each unknown in turn, and vSet the dc^rivative 
equal to zero. Consider a specific unknown, e.g., li. If any minima con« 
tains tif its derivative is 

-^(Vm - ni -ti- bj) 

If the square does not contain ti^ the derivative is zero. Hence, if the 
derivative of the sum of squares in (3.3) is set equal to zero, the resulting 
equation may be written 
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where the sum is over all observations whose equations contain or 
more generally over all observations whose equations contain the param-- 
eter to bo estimatod. 

E(jiiations (3,-1) are called the equations of estimation, or the normal 
equations. As we have- seen, the equation of estimation for any param- 
eter is constructed by equating the total of the observed values yij], to 
tlie total of the expected values {ni + ti + ?>i) over all observations whose 
equation contains the parameter. 

We now write down some normal equations for the example. Since 
m occurs in all 48 observations^, its normal equation is 

48?n "b 4:itt q- ^2 * rd” fe) Hh lbi59 *b 12(&i d- ^2 d" * * * "b ^ 4 ) ~ G (3,5) 

where % denotes the control treatment, and G the grand total of all 48 

observations. At this point a simplification may be introduced. In 
the model the quantities f-t are needed only to indicate by how much an 
individtial treatment differs from the average of all treatments in the 
experiment. Accordingly the model is not changed in any essential 
feature if we assume that the mean of all the ti is zero. It may also be 
shown that any convenient linear function of the U may be assumed to 
bo zero. The same remarks apply to the block values hj. From the form 
of equation (3.5) it appears that convenient assumptions of this kind are 

h “b fe d b === 0 (3.6) 

hi -b &2 d“ ^^3 d" ^4 = 5 = 0 (3.7) 

With these assumption.^ equation (3.5) reduces to 48wz - (?, so that 
m becomes the mean value of all observations in the experiment. 

By summing over the observations that receive a particular treat- 
ment ti we find that its normal equation is 

4???- -b 4:1% d” hi + ho d" hg d“ ^>4 = Ti (?* = 1,2, * * • , 8) 

and for the control, 

“b 16^0 "b 4(hi d“ ^2 ”b ha d“ ^ 4 ) ^ Tq 

where the 2”s denote observed treatment totals. From relation (3,7), 
these equations reduce to 

^ - m (ii = 1, 2, • • ■ , 8); is = ^ ~ m (3.8) 
4 16 

As might be expected, the effect of any treatment is estimated by 
taking the difference between the mean of the observations which re- 
ceive that treatment and the general mean. 
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j nf thev original iiioclel (o.l). 

Let us express these estimates m leum ot • g 
By adding the relevant equations we hnd _ 

f. = 4,1 + 4n- + (ft + ft + ^3 + ft) + ^ 

Hence, using (3.8), ' , o \ j. s. 

„ + i,-, + r,+w. + fc + '*» + *(+;' , 

Ww.i1 .r d,Al«e ^ 

net (iii). This reelt enemplto “ Ww«„ the ot- 

solutions, namely, that the es l^y the envivoiimenttil ei- 

fects of any two treatments is no ' - j,gy 
fects, but only by the residual ‘Jr ps^timates of the treat- 

that environmental effects are ehmmatcd fiom the 

inent effects. . offects are usually of less 

Estimntes of the bloek « ft » 

interest. It may be verihed that the least s.iua 


Bj 

12 


m 


where Bj is the obseiwed bloek totoul. 

3.24 Tests of Significance and Co^dence L^ts^ V guLsequent 

of the treatment effects 

study of them vaiy with yp ■ * conclusions: 

someofthe Mowmgprocecto ^ ‘ ^ the effects r, and 

(i) a test of significance generally, of the hypothesis 

Tu of two treatments are iden . , , construction of 

that some Imear fr- - tJ) between the effects of two 

confidence ^ f unction of the treatment effects; (iii) a test 

treatmente, wfor U that a group of treatments ri, 

£g ttei procedures is presented briefly, without proof. Ihe anth 
metical methods are described m the next section. 


82i TESTS OF SIGNIFICANCE AND CONFIDENCE LIMITS ^5 

j. fcsf of doHfJkmce of ths difference hekveen two teatment e^ffeck. 

From (3AJ) we have 

ti — tu = in - Tu) + (Si - e„) (^-9) 

t he o’s are normally distributed with zero means, it 
y^cory that (b- - Q is normally distributed about the true dtference 
' — Further, the variance of (tj — fu) may be s lown 




,vhere r. and are the numbers of replications of U and tu, 

I- ~h\ - useful to be able to calculate the variance of any compaiison 

iSer:i^ .■»!«. ^ doin. this ..-. given .n secUon 

in (..‘iei- to proceed we need an estimate 

variance From theory, the best estimate is known to be 

■ , , Tie : , 

wwidoal sum of s.,ua.« is ‘“XS (d f 

divisor », , culled the fmmher of degrees of fieedom {d.t.) m 

F;-,-,„al mnul«r„tebse,wations) - (number of .ndependent 

parameters that ivere estimated) 

is introduce he«« ™ 

■m the original model f »'» P*™™ lini relation (3.6) 

the example wc found tha we _ Consequently the number of 

among the I’s and one (3.7) among the b s. conseque, y 

independent parameterscis^^^^^ : ^ _ j.) \ ■ to 

1 (for r«) + 8 (for the t’s) + 3 (for the b s) - 12 
Since there are 48 observations, n. is 36. _ ^ ^ 

not.:;::;;;; S mr »» ani estUted valance 


s2 (I 4- L\ 

Vr,- fu/ 


The ratio 


ti tu 
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is iaiown to follow. Student’s if-distribution with ite, or 36 d.f . This is the 
quantity used for a test of significance of the null hypothesis. 

With a more complicated function of the estimated treatment effects, 
say L = {witi + w^k. + -- -+WA), where the w’s are any set of num- 
bers, the procedure is essentially the same. The estimated variance of 
L is sV, where / is a numerical factor found by the rules given in section 
3.5. For a test of the null hypothesis that the true value of L is zero, we 
use the mtio L/sVf, which is distributed as Student’s t with tie degrees 
of freedom. 

n. The consfructim of cOnfid&ice limits. The theory for a test of sig- 
nificance also leads to confidence limits for the unknown true difference 
(r,- - %). For, when t,- and are not necessarily equal, the quantity 


li'i tu) (r, tu) 



may be shown to follow Student’s /-distribution with «« degrees of free- 
dom. Hence the confidence limits are given by 


(n- — r«) ~ iu — /«) dr Sf/(n«, cv) / 1 

V n- r„ 

where d(nc, «) is the value of Student’s t corresponding to /q- degrees of 
freedom and the chosen confidence probability a. A similar method is 
used for the more genei’al function L. 

Hi. A test of the idetitity of a group of treatment effects. Quite frequently 
the first test to be made is that of the hypothesis that all k treatments 
have produced identical effects, i.e., that all n- are equal. In other eases 
we are more interested in the hypothesis that some subgroup of the treat- 
ments, say treatments 1 to p, has produced the same effects. We con- 
sider the latter test, since it reduces to the first test if p is put equal to k. 

This test is provided by a general theorem that has many applications. 
First, we find the least squares estimates of all treatment and environ- 
mental effects in the usual way, and compute the residual sum of squares 

'Si® == £ (t/ijk ~ m ~ ii “ 6/)® 

Next we start again and rewrite the mathematical model, inserting the 
restriction that ti = rj = • • ■ = tp. That is, wherever any of these 
Ti appears in the equation for an observation, we replace the t,- by a 
common symbol, say r'. The least squares estimates are computed for 
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this iww sot of eqiiMtions, using the same obseryations. Again we com- 
]3utc*- the residual sum of squares which may be denoted 

' ^ JliVvk -nif - t' -b/f 

This will ahvays be found to be at least as large as >Si^. Finally, the 
tlieorem states that, if the null liypotliesis is true, the quantity 

P'— '1 :: ' fte ■ 

follows SnedecoF's F-clistributkm with {p — 1 ) and 7ie degrees of freedom. 

Although tlie i^rocediire may appear rather complex, the test criterion 
used is a reaBoiiable one. Si^ is the sum of squares of deviations of the 
observations from the values predicted for them by the original model, 
so that it measures how closely the original model agrees with the data. 
S2" plaj^s the same role for the restricted model. Consequently, if 82^ is 
much larger than we are inclined to think that the restricted model 
does not lit the data nearly so well as the original model, and therefore 
to reject the null hypothesis that the restricted model is the correct one. 
Ilowever, the value of — Si^) alom does not provide a measure of the 
improvement in the fit with the original model. If, for instance, this dif- 
ference is 10 , then, other things being equal, we should regard the im- 
provement in fit as greater when 81^ is 1 than when 81 is 1000 . It is this 
type of ctmsidenition that leads to the use of the ratio (^2*^ Bi)/8\ 

as the essential part of the tost criterion. 

Although the theorem postulates two separate sets of least squares 
solutions, nearly all designs are constructed so that only one set of solu- 
tions must be found in practice. A number of designs which could have 
been included in this book 'were omitted because the least squares 
analysis seems too cumbersome for frequent use. 

3.25 The Analysis of Variance, All the procedures (M;ests, F-tests, 
and construction of confidence limits) use the residual sum of squares, 
which will often be called the error sum of squares. This quantity could 
1)0 found by calculating for each observation the value {771 + k + h^) 
piedicd-ed by the least scpiares solution. The sum of the squares of the 
differences between observed and predicted values could then be ob- 
tained. This method is skwv, and the error sum of squares is much more 
quickly computed by a technique known as the analysis of variance. 

In the original model, each observation is represented as the sum of 
four components due respectively to the general mean, the effect of the 
treatment, the environmental effect, and the residual effect. In the same 
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way the analysis of varianee partitions the sum of squares of the ob- 
servations into four sums of squares, one attributable to the general 
mean, one to differences between the estimated effects of the treatmezits, 
one to the environmental effects which the experiment is capable of 
measuring, and lastly one which is the residual or error smn of squares. 
In most eases we compute the original sum of squares and the first three 
components, obtaining the error sum of squares by subtraction. 

The analysis of variance provides much more than a short-cut method 
of securing the error sum of squares. The sum of squares due to treat- 
ments is the quantity -- Si^) needed for the F-test of the hypothesis 
that no differences exist between the effects of the treatments. By a 
slight extension the analysis also supplies the sum of squares required 
for testing the equality of the effects of a subgroup of the treatments. 
The component due to environmental effects enables us to estimate b}^ 
how much the accuracy of the experiment has been increased by eliminat- 
ing these effects from the estimates of the treatment means. 

The analysis of variance depends on a number of alge^braic relations 
which will be illustrated for the eelworm experiment. From section 3.23 
it will be recalled that the least sqixares normal equation for any un- 
known, say U, was 

'E, (j/ijk - m ~ ti ~ hj) = 0 

summed over all observations which received the fth treatment. If we 
multiply this equation by U and add the equations for different treat- 
ments together, we obtain the equation 

^0 (3.10) 

where the sum is now over all observations, since every observation is 
associated with one and only one U. Similarly -we may establish the re- 
lations 

EMym ~ m ~ ii -b,) === 0 {3.11) 

' ^ ti hj) ^ 0 (3.12) 

where both sums extend over all observations. 

A few more relations of this type are required. If we add the estimates 
k over all plots in the jth block, the siun is 

+ %+••• + ts) + itQ 

But this is zero because of equation (3.6) in which a linear relation among 
the t-B was introduced. Hence, if we multiply by bj and add over all 
blocks, we establish the relation ' 

= 0 (3.13) 
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where the sum is again over all observationB. By similar arguments we 
prove the additional relations 

^mbj=^Q; (3.14) 

These relations lead to the partition of the sum of squares of the ob- 
servations, Write 

mjk ^ ni + ti + bj + iyijk - m - ti 

Square both sides, and add over all observations. The six relations 
(3.10) to (3.14) show that all six sums arising from cross-product terms 
on the right-hand side add to zero. Consequently, we have the following 
analysis of sums of squares. 

Huifl? - L + S ii' + X + X - TO - t-i - hjf (3.15) 
This equation is the basis of the analysis of variance of the results. 



3*26 AppHcation to the Example* To obtain the left-hand side of 

(3.15) in practice, we simply compute the sum of the squares of all ob- ; 

servations. For the eelworm data this will be found to be 5,481,198. 

The components due to the mean, to treatments, and to blocks are not 
usually calculated from (3.15) as it stands, because it is quicker and more 
accurate to obtain them from totals rather than fx’om the estimated ef- 
fects. ' 

Thus, for the mean, the contribution is 48?7i^, since there are 48 ob- 
seiwations. If 0 is the grand total of all observations, 14,680, this con- 
tribution may be written (?^/48, or (14,680)^/48, which amounts to 
4,489,633. This term is sometimes called the correction for the mean. 

Written in full, the sum of squares for treatments is 

4(q^ + + • • * + t^) + 16fe^ (3.16) f 

But from (3.8), f 

(i = 1, 2, 8); i9=™~TO i 


where Ti is the observed treatment total. By substitution, (3.16) be- 


-im - 4to)2 + (T 3 - 4m)2 + . . . + (n - 4 to)2] + tV (^9 ~ 

When each parenthesis is expanded, we have 

TJ 4- + 1- Ts® Tn^ 

±-±. + -L-2m{Ti + Ts + ---+Ts + Tg) + 4:Bm^ 

4 16 - 
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Now the sum of the observed treatment totals Ti is the grand total (?, 
or 487n, Consequently the last two terms above may be amalgamated to 
give 

+ ( 3 . 17 ) 

:: i 16 

Note that the last term is the correction for the mean, already found to 
be 4,489,633. Expression (3.17) is an example of the moj-e general ex- 
pression 

k rp 2 

X — -C 

which gives the treatments sum of squares when the ^th treatment is 
replicated times, C being the correction for the mean. 

To obtain the numerical value we first calculate the treatment totals 
as showm in table 3.2. 

TABL.E 3,2 Treatment totals from table 3,1 
■Level of ■■ ■ ■ 


applicatioB 

' :ON: ■ 

cs 

CM 

GK 

Totals 

0 


585S 


5858 

1 

1066 

628 

1431 

S02 

4317 

2 

1265 

877 

1241 

1122 

4505 


The treatments sum of squares is given by 
(1066)2 + (1265)2 + . . .+ (1122)2 ^ (sgsg)^ 


- 4,489,633 = 157,448 


The blocks sum of squares, ^^n likewise be expressed in terms 

of block totals as follows. 


> 42 ■ '^^'4 


-T 


(4383)2 + (3075)2 + (4752)2 ( 2470 )=^ 


4,489,633 « 289,427 


where the observed block totals have been inserted from table 3.1. The 
divisor 12 is the number of observations per block. 

The complete analysis of variance is shown in table 3.3. It is not 
customary to display the correction term for the mean, since this is of no 
particular' interest. The item labelled “total” at the foot of the table is 
the original sum of squares, minus the correction for the mean (5,481,198 
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ft: 

I 


— 4j489,633 = 991,665). This may be shown to be equal to the sum of 
squares of deviations of the observations from their mean; i.e., 

X Vii^ “ 48m2 = 2] ivm ~ 

TABLE 3.3 Analysis op vaeiance op the data in table 3.1 



Degrees of 

Sum of 

Mean 


freedom 

squares 

square 

Source of variation 

(d.f.) 

(s.s.) 

(m.s.) 

Treatments 

8 

157,448 

19,6<81 

Blocks 

3 

289,427 

96,476 

Error (by subtraction) 

36 

544,690 

15,130 

Total 

47 

991,565 



The column which has been computed above is, of course, the column 
labelled ^ ‘sum of squares.” The other two columns remain to be ex- 
plained. The ‘^degrees of freedom” associated with any component are 
the number of independent parameters required to describe that com- 
ponent in tlie model. In the case of treatments, this always equals one 
less than the number of treatments, and similarly for blocks. For the 
total S.S., the number is the number of observations (48) less one repre- 
senting the contribution of the mean. 

The degrees of freedom have two principal uses. First, by subtraction, 
the3^ giv’e the degrees of freedom for error (36). As mentioned in section 
3.24, this is the divisor needed for the error b.s. in order to estimate the 
error variance The estimate is 544,690/36, or 15,130, which is called 
the error mem square and is shown in the right-hand column of table 3.3. 
Second, the degrees of freedom for treatments are used in an /■'Ltest of 
the hypothesis that all k treatments produced the same effects. In 
section 3.24 the value of F for testing this hypothesis was given as 

k 1 He 

The denominator Si^/rie is the error m.s. In the numerator the quan- 
tity { 82 ^ Si^) may be shovm to be the treatments s.s., while {k — 1) 
is the number of degrees of freedom associated with treatments. Thus 
the numerator is 157,448/8, or 19,681, called the treatments mean square: 
The F-ratio for this experiment is 19,681/15,130, or 1.30, with 8 and 36 
d.f. 

In the same way the mean square for blocks forms the numerator for 
the F-test of the hypothesis that all block effects are identical. Although 
this test is rather seldom of interest, the mean square for blocks has other 
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^ By fomute which ii-or yarim-® that wouU lave 

mean square leads to an e^ ^ not been grouped into blocks. 

been obtained if tbe ei^r^^t b 

With some designs the strucw ^ ,s that tlie 

complicated than in JJ^^trtSions to the sum of squares are 
treatment and environmental co ^ different result is ob- 

entangled. If we calculate the tre^ blU effects care present or 
tained according to ^ In such cases the usual procedure is 

absent, and vice versa for b ^ i ^j^en treatment ettects are ig- 
to present the blocks s.s. as ndien block effects are taken 

nored, and the treatments called “blocks, ignoring treat- 

into account. These items ai . ^ . iji^cks,” respectively. If calcula- 
ments” and “treatments, ^Icrms may still be added so as to 

tions_ are made in ^^“^^bteLtion; further, the treatments s.s. is stil 
obtain the error s.s. ^ ^ treatment effects, 

the appropriate one for “ . f variance is necessarily verj' m- 

This presentation of the ^ should be made to Fisher 

adequate. ^ fo ^chose discussions involve little 

(3.2, 3.3) and Snedecor theory, see Kendall’s book 3.5) 

ematics. For presentation of Unfortunately, no single 

and papers by Fisher ^^’^^^nprehensive accoimt of the subject. 

reference contams a really oompieiiui 

3.3 Accuracy in Computations 
3.31 Original Records. « 

rr maintaining a high standard of 

“£stplace.Kereerrms^^^^^^^^ 

should be made m a ^Ica y g checking each reading immediately 
readings are being taken, the _ „bsevvfil,ions are 

after it has been written dow^j^^^XrsLiM be examined and the 

in tW. w;ay. H « 

reason sought. Gross explanation is found, a note to 

checking such an observatio ^ situations where 

this effect should be mad^ ^i-fpprent nerson. Most statisticians who 

the later analysis is done encountered the perplexing 

analyze other people P completely out of line that a gross 

— :o^Uin no oon.„«>. aW 
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the observation, and the recorder’s memory of the circumstances in 
which it was taken has faded. ^ ^ ^ ^ ^ 

Copying from the original records to a form suitable for analysis may 
also introduce errors. Since cross-checking is rather a soporific task, mis- 
takes tend to be overlooked even when two persons are used for the 
check. Sometimes foresight can eliminate the need for copying. 

3.32 Checks in the Analysis of Variance. The analysis of variance it- 
self consists of two sorts of operations. First, a number of totals are 
formed. These are nearly all self-checking, for the fact that the treat- 
ment totals and the block totals both add to the grand total may be ac- 
cepted as proof of their correctrxess. This is not a watertight check, since 
compensating errors in two treatment totals pass undetected. Also it 
does not ensure that treatments and blocks are correctly identified, and 
cases have occurred where treatment totals were mistakenly labelled as 
block totals, and vice versa. But apart from such rare events the sum 
check is as satisfactory as any. 

The second stage, the computation of the various sums of squares, is 
not self-checking and must be checked by recomputation. The most ac- 
curate metliod is to have two computers carry out the analysis inde- 
pendently, comparing results only when both have finished the task. 
The practice of having one computer *^check’’ the results of the other, 
though more convenient, alwayvS seems to produce a certain number of 
errors. This is to be expected, because, when the first calculations are 
nearly always correct, as they should be, it is diflficult for the checker to 
avoid the presupposition that they are correct, with resulting mental 
laziness. Recomputation by the same person, though often unavoidable, 
is probably less accurate than the two preceding methods. 

An expert who mshes to do so can make the analysis self-checking. 
For instance, the error s,s. can be computed independently of the other 
sums of squares, sometimes quite easily and sometimes with more diffi- 
culty. If this sum of squares agrees with that found by subtraction, all 
the items in the “sums of squares’’ column may be regarded as correct, 
apart possibly from errors in labelling. However, with the computa- 
tional methods that we present, checking is required except in, a few cases 
specifically noted. 

The steps in the preparation of sumrnaiy tables (e.g., calculation of 
treatment means from totals) are usually partly but not entirely self- 
checking. For example, the fact that treatment means average to the 
general mean foims a check. Care should be taken to recompute those 
steps that are not self-checked. 


54 NOTES ON THE STATISTICAL ANALYSIS OF THE EESULTS 3.32 

With the more complex designs, where treatment means have to be 
adjusted in order to eliminate the environmental effects, errors have 
arisen because the computing instructions were misunderstood. For 
instance, the adjustments made wwe ten times as large as they should 
have been. A person thoroughly familiar with the design would have 
known that the adjustments were ridiculously large, but someone new 
to the design may have no idea what size of adjustment it is reasonable 
to expect. This danger with complex designs should not be ignored. It 
can be minimized by careful study of the method of analysis for any 
design that it is proposed to use. 

3*33 Nmnber of Figures to be Retained, Since the primary purpose 
of the analysis of variance is to obtain an accurate value for the error 
m.s., the number of figures which it is worth retaining during the cal- 
culations depends on the size of this figure. This is not knovm in ad- 
vance! but often a rough idea of the coefficient of variation (ratio of the 
standard error to the mean of the experiment) is available from previous 
experience. For the original data from which the analysis of ^uiriance is 
computed, a crude rule which errs on the safe side is as follows. Record 
the original data to 4 significant figures if the coefficient of variation is 
between 0.4 and 4%, to 3 if it is between 4 and 40%, and to 2 if it exceeds 
40%, Let us apply this mle to the eelworm data. Since experiments on 
eelworms are not very common, there is not much basis on whicli to 
predict the coefficient of variation. Suppose that we guess that it is un- 
likely to be below 20% (it actually turned out to be 40%). Then, for 
safety, data should be recorded to 3 significant figures. Since the average 
eelworm number per plot is around 300, this means that data are re- 
corded to the nearest eelworm, which is as accurately as they could be 
recorded. 

A more precise rule, requiring a little calculation, is that the rounding 
interval should not exceed one-quarter of the standard error per observa- 
tion. To apply this rule to the eelworm data, starting with the same 
premise that the coefficient of variation is 20%, we note that the mean 
per plot in the experiment is about 300. Thus the predicted value of the 
standard error per observation is about fiO, so that the rouiiding interval 
should not exceed 15. It will be quite satisfactory to roiunl each ob- 
servation to the nearest 10 eelworms. The discrepancy from the result 
given by the previous rule illustrates the fact that the first rule tends to 
be too conservative. 

The advisability of deliberately rounding records already taken should 
be left to the judgment of the experimenter. With modern calculating 
machines it may be more expeditious to carry out the computations with 
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a few imnecessary figures than to take time to perform and check the 
rounding. The rules are most likely to be useful where computing de- 
vices are poor or imavailable. 

In the analysis of variance itself » it is generally as well to carry the full 
number of figures obtained from the uncorrected sum of squares; e.g., if 
the original data contain one decimal place^ the sum of squares and hence 
the analysis of variance will contain two decimal places. This number 
will iisiiaily be excessive, but no more time is likely to be wasted in 
writing down the unnecessary digits than in estimating how many digits 
should be retained. 

In the final presentation, of results, on the other hand, vsuperfluous 
digits should be strictly avoided. Tiie^t make the conclusions more 
difficult to grasp, impede rapid mental comparisons, and give some 
readers an erroneous impression of the accuracy of the results. A treat- 
ment mean should be rounded to one-tenth of its estimated standard 
error; that is, if the estimated standard error of the treatment means is 
2.56, the means could be rounded to Q.256 and should be rounded to 0.1, 
i.e., to one decimal placej since this is the nearest ixjunding interval that 
is convenient. In some experiments (e.g., those of the split-plot type) 
difierent treatment comparisons have different standard errors. Here 
the rounding interval should be decided from those comparisons that 
have the smallest error. 

3,34 Identification of Data, The practice Kshould be followed of label- 
ling all data, from original records to summary tables, so that another 
person at a later time can tell what the data mean. Much data, ex- 
pensive to collect and potentially valuable for some research, have had 
to be discarded or destroyed because the original collector cannot be 
reached, and new investigators are unable to discover what measure- 
ments were taken and under what circumstances. Sometimes the units 
in which the observations or summary tables are presented cannot be 
found, or the crop on which a field experiment was carried out, or the 
nature of the experimental treatments. Frecpiently, data serve their 
most fruitful purpose when some later investigator gathers from many 
places «all the material bearing on some question, and it should be a habit 
to facilitate ratliei* than hinder sucli research. 

3.4 Subdivision of the Sum of Squares for Treatments 

3.41 Reasons for Subdivision. Often an experiment is planned to 
provide the answers to a number of different questions, not necessarily 
connected with one another. The eel worm experiment is a partial 
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though not an ideal example. One way in which the questiims asked 
might be framed is as follows, (i) Is soil fumigation effective? Here we 
are interested in the average reduction in eelworm numbers taken over 
all fumigants, (ii) Are there differences in the effectiveness of different 
fumigants? For this question^ comparisons of the reductions due to dif- 
ferent fumigants are relevant, (iii) What is the relative effectiveness of 
single and double doses? Since the simplest reasonable hypothesis is 
that the reduction in eelworm numbers to the double dose is twice that 
to the single dose, we might narrow this question by asking: Is the aver- 
age response to fumigation proportional to the amount of dressing, or in 
other words is the average response curve linear? (iv) Finally, if there is 
some indication of curvature in the responses, we might ask whether the 
amount of curvature is the same for all fumigants. 

With experiments of this type the F-test of the complete treatments 
m.s. is not particularly helpful because it is directed, as it w-ere, at a 
mixture of several diverse questions. By an extension of the analysis of 
variance, we can subdivide the treatments s.s. into a number of com- 
ponents that are more relevant to the individual questions. Moreover, 
an F-test can be made on the mean square for each component. The 
rules for subdivision are given in succeeding sections. Effective use of 
the device requires long practice applied to a considerable number of ex- 
periments. 

3.42 Subdivision into Singie Components. The simplest (*ase, where 
all treatments have the same number r of replications, is considered first. 
The calculations are best made from the treatment totals Examples 
of the sort of quantity that we are interested in studying are 

/ ^ : Ti-{-T2 

T, ^ T2] — ^ Ta; ^ - {T^ + T,) 

Quantities of this type are called Imear functions of the T^. Note that 
the sum of the coefiicients of the is always zero, as it must be if the 
quantity is to represent a comparison among the !^^s. These ideas may 
be formalized as follows. 

Definition, Any linear function 

Zw = IwiTi 4” * “h l^jfTk 

is called a comparison among the if 
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Buie 1. If Zy, is any comparison among the T’s, the quantity 

, where Z)„ = + l,o^ + • - • + 

is a component of tlie sum of squares for treatments and represents 1 di . 

This rule gives the divisor Au to be used so that the comparison 
may be included as a part of the treatments s.s. in the analysis of vari- 
ance, Incidentally j the rule implies that cannot exceed the treat- 

ments s.s. If it is found to do so, this is a sure sign of a mistake in com- 
Xxiitation. 

Definition. Two comparisons and Z 2 are said to be orthogonal if 
hihi + h2h2 + • • • + likhk “ 0 

This ^^sum of products^^ relation is very important. By theory, it 
ensures that .Si and are distributed independently of each other. Its 
relevance to the partition of the treatments s.s. appears in rule 2. 

Rule If and Z 2 nre orthogonal, Him Z 2 ^/D 2 is a component of 

rn 

i reatments s.s. 

Di 

This means that, if we divide the treatments s.s. into the contribution 
from a comparison and the remainder, and now wish to subdivide the 
remainder, we must choose comparisons that are orthogonal to zi. Sim- 
ilarly, after removing the contribution of the next comparison must 
be orthogonal to both and and so on. This leads to the next mile. 

Rnle 8. If the comparisons Z 2 , * • zj^^i are mutually orthogonal, 
i.e., eveiy pair is orthogonal, then 

■ O' O 0 ■ . ■ ■■ . 

Zi Z2^ Zk-l" 

Treatments s.s. = 1 1 1 

Di D 2 Dk^i 

This algebraic identity partitions the treatments s.s. [which has (k — 1) 
degrees of freedom] into {k — 1) components, each representing a single 
degree of freedom. 

Starting with a specified we can always find Z 2 i - • * , so as to 
construct a complete orthogonal set. In fact, it may be proved that there 
is considerable freedom of choice in this process. When all but the final 
Zh^i have been selected, there is only one possible choice for but at 
any previous stage many can be found that are orthogonal to all pre- 
ceding The experimenter must choose the z'b that are most relevant 
for purposes of interpretation. 
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Table 3.4 shows the single components most frequently used for inter- 
pretative purposes when there are three or four treatments. It is as- 

TABLE 3.4 Sample sets op single components 
Three treatments 

i. Equally spaced increments of one ii. Two qualities of an ingredient and 
ingredient a control 






^1 

^2 

Ti 

--i 

1 

(0) 

■ -2' ■ 

0 

T2 

0 

—2 

T 2 (ai) 

1 

-. 1 . 

T, 

1 

1 

Tq (ao) 

1 

1 

Component 

Lin. 

Quad. 


Effect 

Quality 

divisor 7) 

2r^ 

6r 

Component 

of a 

diff. 




divisor D 

6r 

2r 



Four treatments 




i. Equally spaced increments of one 

ii. Two comparable types of ingredient, 


ingredient 


a and7), and two qualities of each type 



S!2 

^3 


Zi 22 

^3 

Tt 


+ 1 


Ti (ai) 

4-1 4-1 

0 

3’2 


-1 

+3 

Ti (as) 

4-1 -1 

0 

Tb 

-fl 

-1 

-3 

T-i (bi) 

-1 0 

4"1 

Ta, 

“bS 

+ 1 

+ 1 

Ti (hi) 

~1 0 


Component 

Lin. 

Quad. 

Cubic 


Quality 

Quality 

D 

20r 

4r 

20r 


diff. 

diff. 





Component a vs. h witliin a 

within h 





D 

4r 2r 

2r 

iii. 

Different levels of each of two different ingredients a and h 






22 

^3 



Ti 

(aibi) 

-1 

-1 

-hi 



Ti 







Ts 

(aiba) 


+ 1 




Ta 

(flih) 

4"! 

-4-1 

4-1 





Average 

A^mrago 






response 

response 




Component 

to a 

to 

Interaction 




D 

4f 

4r 

. ■ 4r ■ 



r* « number of replicates, assumed the same for all treatments. The coefficients 
given above are not valid if r varies from treatment to treatment. 


sumed that all treatments have the same number of replicates. The 
divisors D are those required for inserting the square of g in the analysis 
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oi variance. The reader should verify that these divisors satisfy rule 1, 
and that each set of forms a complete orthogonal set. 

With three treatments the two sets presented are essentially the same, 
though the meaning attached to each z is different. Case i applies in an 
experiment where we have, say, zero, single, and double dressings of a 
fumigant, and wish to isolate the linear and quadratic components of the 
response curve. The response found by fitting a straight line to the three 
levels is represented by while Z 2 measures the deviation from a linear 
response. Case ii arises when we have comparable dressings of two dif- 
ferent fumigants, and wish to examine the average effect of fumigation 
(^i) and the difference between the fumigants ( 2 : 2 ). 

With four treatments, case i shows zero, single, double, and triple ap- 
plications of the same ingredient; here the response curve can be divided 
into its linear, quadratic, and cubic components. Alternatively, in case 
ii, we might be testing the wearing qualities of two types of ‘T00% wooF^ 
suits and two types of suits made of w(3ol-rayon mixtures. Case iii is 
used, for instance, when we examine the effects of two different amounts 
of sugar and two different amounts of vanilla in the preparation of a cake. 
Experiments of this type are discussed more fully in chapter 5. 

When different treatments have differing numbers of replicates, the 
I'ules are changed slightly. If the ith treatment has replicates, we have 
shown that the treatments s.s. is 


The changes to be noted are: 

i. A linear function 

= IwlTi + IW2T2 H h livhTh 

is a comparison among the treatment totals Ti if 

ii. The divisor required for is 

Dll} ^ 7^ily)X ~t" ^^2^' w2 “f" * * * “b 

iii. Two comparisons zi and Z 2 are orthogonal if 

+ ’^" 2 ^ 32^2 T/chkhk 0 

3,43 Incomplete Subdivisions, The subdivision need not be com- 
plete in the sense that it is composed entirely of single components. In 
the eelworm experiment we suggested that the treatments s.s., with 8 


T.2 


m 2 

j'l rg 


n (Ln) 
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components, might be divided into the following parts: (i) a single com- 
ponent representing the average i^esponse to fumigation; (ii) a single 
component representing the deviation of the average response from lin- 
earity; (hi) a part representing differences in the responses to the dif- 
ferent fumigants. Since there are 4 fumigants, this part comprizes 3 
components; (iv) a part representing differences in the curvature of the 
response curves for different fumigants: this also has 3 components. 
Some rules for partial subdivision will be given and applied to this ex- 
ample. The most general rule is as follows. 

Rule 4 . A set of g quantities Qi are independent components of the 
treatments s.s.j with rii degrees of freedom respectively, if 


+ ••*+■ 


(i = 1,2, 


where all Zij are mutually orthogonal comparisons, and the Dij are the 
appropriate divisors. Further, the remaiiider 

Treatments s.s. — ^ Qi 

is an independent component of the treatments s.s. with 
[(A — 1) — 2^ Ui] degrees of freedom 

This rule, which is a deduction from previous rules, is not very helpful 
to the beginner. With experience it often becomes easy, without carry- 
ing out the details, to see whether a set of Qi can be expressed in this 
way. Some more specialized rules follow. 

Rule 4a. If Ti, ^ 2 ? ' * *? I'p are the totals for any set of treatments, 
then ■ ■ ■■ ■ 

+ . . .+ 

. .y ,= — ^ -I. , v , ■ ■ 

n ’’2 Tp ^ n 

is a component of the treatments s.s., with (p — 1) degrees of freedom. 

This rule states that the sum of squares of deviations among sub- 
group of the treatments, with proper divisors, is part of the treatments 
:s.s.'," 

Rule 4b. Sometimes the treatments (or part of them) can be divided 
into a number of groups. The number of treatments in a group need not 
be constant, and different treatments may have different amounts of 
replication. Let Tij be the total for the jth treatment in the fth group, 
aud let 

Si = Til + Ti2 -f ■ . • 3\pj,‘ 

Ri — rn rf2 + • • ■ + Tip. 


(f- 1, 
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Then let 


Hr. 


(Si +... + Sg)2 


Ri 4" ■ • • “f- i2g 

In this case Qi represents the sum of squares of deviations among the 
treatments in the ith group, with (pf — 1) degrees of freedom, while 
Qg+i is the >sum of squares of deviations among the group totals with 
(g — 1) degmes of freedom. These Q^s are independent components of 
the treatments s.s. 

Rule This applies when all treatments have the same number r of 
replicates, and every giTmp contains 2? treatments. Let 

W'here the set of Ts remains the same in all groups. Then 

. . 

is a component of the treatments s.s., with (g — 1) degrees of freedom. 
This rule is useful wiien we wish to compare linear functions of the treat- 
ments witliin eacth group. Note that the divisor in Q is the same as the 
divisor that would be used for a single 

Applwalicms. In the eelworm experiment, the average effects of fumi- 
gation are obtained from a comparison of the total for the control with 
the totals for the single and double levels of fumigation. Consequently, 
^ve might regard the 9 treatments as divided into 3 groups: the control, 
the single levels, and the double levels. By rule 45, this gives the follow- 
ing partition of the treatments s.s. (the treatment totals are found in 
table 3.2, p. 50)* 

Between fiiinigants (single level): 

(1066)2 + (928)2 + (1431)2 + (892)2 (4317)2 

_■ ::=:;.45,461 V 

4 16 

Between fumigants (dnublf.^ level): 

(1205)“ + (877)2 + (1241)2 + (1122)- (4505)2 


Between levels: 
<2 d.{.) 


(6858)2 4. (4317)2 ^ (4505)“ (14,680)“ 


The three sums of squares add to 157,449, in agreement with the treat- 
ments S.S., 157,448, as given in table 3.3, p. 51. 



02 NOTES ON THE STATISTICAL ANALA^SIS OF THE RESULTS 3.43 

The next step is to divide the sum of squares between into a 

component representing the average response to fumigation and one 
representing the deviation from a linear response. By least squares 
theory, the former is given by a comparison of the doulde level with ihe 
Kero level That is, we may take 

^ 4505 - 5858 - -1353 

where the subscript refers to the level The deviation fi’om linearity is 
HK^asiired by the orthogonal comparison 

Z 2 - (82 ““ Si) - (Si — aSo) = S2 + Sq — 2 Si 
4505 + 5858 - 2(4317) = 1729 

Since each S is a total ovei’ Id plots, the divisors are 32 and 96 re- 
spectively. Hence ihe coiitributions to the sum of squares are: 


Average linear response: 


. ■■ 32 ; 

(1729)^ 

Average curvature: = 31,140 

9G 

The subdivision found thus far might be presented as shown below. 


31,140 


Between treatments 

d.f. 

8 

s.s. 

157,449 

m.s. 

19,681 

Average linear rf^sj^onse 

1 

57,207 

57,207 

Average curvature 

1 

31,140 

31,140 

Between single levels of fumigaxits 

3 

45,461 

15,154 

B<.?tween double bivels of fumigants 

3 

23,641 

7,880 


This is not the subdivision envisaged at the beginning of this section. 
Instead, it was proposed to divide the final 6 d.f. into 3 representing dif- 
ferences in the responses to individual fumigants and 3 representing dif- 
ferences in the curvatures of the individual response curves. This 
separation is more difficult. 

Consider a comparison of (CN) and (CS). Estimates of the difference 
between these fumigants are available both, at the single (ICN) and the 
double (2CN) level If the effects of both fumigants are proportional 
to the amounts of dressing, the true difference at the double level will 
be twice that at the single level On this assumption, two independent 
estimates of the difference between (CN) and (CS), for a single dressing, 
are 

, V ' - ^ [(ICN) - (ICS)] and ~ 
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The second estimate is considerably more accurate than the first, since 
its variance is only ]4 as large. Statistical theory shows that such 
estimates are combined by weighting each inveraely as its variance. 

Hence, on the assumption of linearity, the most accurate estimate of 
the dilference per unit dressing is 

[(ICN) - (ICS)] 4- (4)(|)[(2CN) - (2CS) ] 

1 -f 4 , : 

[(ICN) - (]CS)] + 2[(2CN) - (2CS)I 

_ ^ — 

5- 

This shows that differences in the linear respijnses to the 4 fumigants 
are measured by comparisons of the quantities (ICN) + 2(2CN), etc. 
These quantities are given below. 

(Single level) + 2(double level) 

(CK) (CS) (C^I) (CK) Total 

359(> 2682 3913 3136 13,327 

It is for quantities of this kind that rule 4c is useful. This rule shows that 
the sum of s(|uai*es of deviations of these quantities, when divided by 
("0 [(1)^ + (2)^]? 20, is a component of the treatments s.s. 

This gives 43,408, repi^esenting differences in linear responses to the 
fiimigants. ■ ■ 

We now consider the final three components. The curvature of an 
individual re*sponse curve is measured by a comparison of the type 

[(2CN) - (ICN)] -- [(ICN) -- (0)] - [(2CN) + (0) - 2(1CN)] 

where (0) represents the total number of eelworms over 4 plots for the 
control treatment. When the difference between two fumigants is taken, 
the (0) term disappears and we obtain the comparison 

[(2CN) -- 2(1CN)] - [(2CS) -- 2(1CS)] (3.19) 

It is easy to verify that this is orthogonal with comparisons like (3.18). 
Consequently differences in curvature are compared by means of the 
quantities shown below. 

2 (Single level) — (double level) 

(GN) (CS) (CM) (CK) Total 

867 979 1021 662 4129 

Hide 4c again applies. The divisor is 20, and the sum of squares con- 
tributes 25,693. The final separation is given in table 3.5. It is evident 
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from f 4tvstB tluxt none of the mean squares is significant at the 5% level, 
ami only that for the average linear response approaches near to the 
5 % level. Th(?se results must not be taken as final conclusions for this 
(‘xpxM’inient. As will bo seen in section 3.8, additional data weie lecoided 
in this oxj)eriinent whicli permit a more accurate analysis. 

l-WJUiE 3.0 BiiiiDn’isioN of the treatments s.s. for the eelworm data 


Hciarcv of variiifi{»n 

d.f. 

s.s. 

m.s. 

Avorago linear 3’L*,sp(mse 

1 

57,207 

57,207 

Averag<.r eiu'vaiimv 

i 

31,140 

31,140 

Differen(M‘s in linear rt^Hpoiise 

3 

43,408 

14,469 

Diffcirei ices in curvature 

■ '3 

25,693 

8,564 

Error 

36 

544,690 

15,130 


A more extensive discussion of this general topic, with a number of 
examples, is given in Snedecor's book (3.4), chapter 15. It is worth re- 
peating tiiat the amount and type of subdivision that should be done de- 
pend on the experiment, and to some extent on individual taste. As will 
be seen in the next section, any single component can be tested by means 
of a /-test derived from the treatment means, and some workers prefer to 
make, the test in this way . 

3J; Calculation of Standard Errors for Comparisons among 
Treatment Means - 

3J1 Rules. We first give a rule (rule 5) that is more general than 
we iK‘nd, because it is sometimes useful for other purposes. 

Ruin . 7 . The standard error of any linear function 

Z ^ liyi + Z22/2 H + InVn 

of the individual observations is 

CTjf — (T 's/l^ 4" 4“ . . . -j- 

The edimaiod standard error is obtained by substituting 5 for a, where b 
is the square root of the error m.s. in the analysis of variance. This rule 
supplies the standard error of any kind of linear function, provided that 
it has been expressed in terms of the individual ohservations. For linear 
functions of the ireatmmt nieariB^ one of two special cases of the rule is 

Rule 5a, If all treatments have the same number r of replications, the 
standard error of a linear function 
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Rule Sb, If the ith treatment 1ms n replications* the standard error of 
z is 





+ 


II 

n 


As in the general casej we substitute s for (r in order to obtain the esti- 
mated standard error. Finall^^v a rule that may sa,ve labor in complex 
cases is as folUws. 

Rule 6, If the linear functions Si, 02 ? ' • • j are mutually orthogonal, 
the standard error of 


Z = + I2Z2 -f* * • ‘ + Ip^p 

is 

CTjj, = . . . -p Ip^CTp^ 

wdiere af is the variance of Zi, 

This rule enables the work to be done in two stages. Sometimes, in a 
large experiment, it is relatively eas 3 ^ to express the function z in terms 
of a number of familiar orthogonal functions, l^ut rather tedious to wiite 
z ill terms of original observations. The variances of tlie Zi may be found 
by rule 5 and substituted in rule 6. Note that ditferent Zi may involve 
the same set of observations, provided that the Zi are orthogonal 


3.62 Examples. These rules will now be illustrated by application to 
the eelworm experiment. The treatment means are shovm in table 3.6, 
the figures in parentheses denoting the order in which the treatments are 
numbered. 


TABU] 3.6 Mean numbers op ekuworms per plot 
Fumigant 


Level 

(ON) (CS) 

(CM) 

(CK) 

Means 

0 

366(9) 



366 

1 

266(5) 232(6), 

358(7) 

223(8) 

270 

2 

316(1) 219(2) 

310(3) 

280(4) 

281 


From table 3.5 the standard error per plot is '\/(15,130), or 5 — 123, 
with 36 di. 
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Emniple 1. Standard error of the average linear response. This is 
iiieasurc3d by the average response to the double dressing, i.e., 

z ^ |■(?7l + + ]h + yd #9 ~ *^^5 

Since the first 4 means are based on 4 replicates, while 2/9 bas 16 repli- 
cates, we apply rule 55, which gives 


s. ~ ( i 23) / f rr 7 

V \16/\4/ \16/\4 


^)GM^ 


4\ / 1 \/l\ / 1 

4/ \16A4/ \16 


The value of Student^s i is 85/43.5, or 1.Q5. 

As mentioned previously, the I-test of any comparison, as made above, 
is identical with the F-test of the corresponding component in the 
analysis of variance. The basic relationship is F = 4^; that is, if the 5% 
values of t for n degrees of freedom are read from the table and squared, 
their squares are the 5% F values for 1 and n degrees of freedom, and 
similarly for any other significance level This result may be verified in 
the present instance. The value of f is 3,80, with 36 di. From the 
analysis of variance in table 3.5, the F value for the average linear re- 
sponse is 57,207/15,130 or 3.78, with 1 and 36 d.f., the two values 
agreeing apart from rounding errors. The reader may check that the 
same agreement holds for the average curvature. 

Exmnph $. Response to the single dressing of (CN). The response is 
measured by 

^ = 2/5 yo 


OrK?e again rule 56 is appropriate. 


1 1 

— f- ■ — 
4 16 


123 V5 


= 68,8 


Example 3. Differetice between the linear responses to (CN) and (CS). 
In section 3.43 it was shown that this difference is 

Ivs ~ §6 + 2yi - 2^^] 


Since all means have 4 replicates, rule 5a may be used. 

■ ' * 777 ^^ vT+i + 4 + 4 = (12.3)(Vio ) = 38.9 
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Example 4- Although the experiment is not complex enough to ex- 
hibit a profitable application of rule 6, this example shows that the rule 
agrees with the other rules. Suppose that at the double level we had 
compared (CN) with (CS), and also the two chemicals with the two 
proprietary mixtures. The comparisons are 

% = i /1 - #2; 22 

By rule 5a their standard errors are 

2 

and it will be noted that the two functions are orthogonal. 

If now we wished to compare the double dressing of (CN) with the 
mean of tlie 3 other double dressings, the comparison would be 

. h+p3 + ^j4 

Z = y^~- 


?7i + h _ ga + Vi 
2 2 


a 



By a direct use of rule 5a, we have 



1 1 1 
1 + - + - + - = 
9 9 9 


VS 


But alternatively we may write 

2zi 2 z2 

^ T'^T 

and apply rule 9 to give 



in agreement with the more direct method. 

3.63 Testing Effects Suggested by the Data. In order that F- and t- 
tests be valid, the tests to be made in an experiment should be chosen 
before the results have been inspected. The reason for this is not hard 
to .see. If tests are selected after inspection of the data, there is a natural 
tendency to select comparisons that appear to give large differences. 
Now large apparent differences may arise either because there are large 
real effects, or because of a fortuitous combination of the experimental 
errors. Consequently, in so far as differences are selected just because 
they seem to be large, it is likely that an undue proportion of the eases 
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select od will be those where the errors have combined to make the dif- 
ferences large. The extreme case most commonly cited is that of the ex- 
pejiinenter who always tests, by an ordinary f-test, the difference be- 
tween the higiiest and the lowest treatment means. If the number of 
treatments is largo, this difference will be substantial even when the 
treal monl s produce no real differences in effect. It may be showm that 
with 1 menl,s the observed value of t will exceed the 5% level in the 
table about 13% of the time. With 6 treatments the figure is 40%, with 
10 tr(,>atmcuts 00%, and with 20 treatments 90%. WiOien the expeii- 
mcmtor thinks that be i.s making a i-test at the 5% level, he is actually 
testing at tJio 13% level, or the 40% level, and so on. To summarize, 
the selection of those differences that look large and therefore “in- 
terc.stiug” invalidates the ordinary tests of significance. The effect is to 
obtain too many significant results, or to raise the significance level of 
the test from the presumed 5% to some higher level, usually unknown, 
On the other hand, the rule that no tests must be constructed after 
seeing the data seems contrary to sound scientific practice. Often the 
initial experiments in a line of research are conducted in order to “see 
what happens”; the research worker does not know in advance what 
comparisons he may wish to test. And even wdiere hypotheses about 
the nature of the results can be set \ip, experiments often indicate 
strongly, as is later confirmed, that these hypotheses are erroneous and 
that the method of analysis based on them is inappropriate. Con- 
sequently, although the effects of selection cannot be ignored, they should 
not deter the experimenter from the most careful examination of his 
data. Any difference that is of interest, whether anticipated or not, 
should be te,stcd. If the ordinary tests show that the difference is not 
signifi(utnt, then at least we know that the difference can be accounted 
for without supposing any real effect. If an unexpected difference is 
.statistically significant, the possibility of a selection effect should be 
borne in mind. When the point at issue is important, the best pro- 
cedure is to conduct a new experiment specifically designed to confirm 
or disprove the indications from the previous results. 

The reporting of such effects presents difficulties. The following is a 
suggestion. “The experiment also indicated an imexpected effect of 
• • • . Although this was statistically significant at the 5% level, it is 
contrary to previous experience, and must at present be regarded with 
caution. Further work on this point will be undertaken.” The diffi- 
culties with this kind of writing are that we do not wish to imply that 
the effect is definitely established, and yet the writing should not en- 
courage the reader to ignore any result that is contrary to our precon- 
ceived notions. 
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3.6 Subdivision of the Sum of Squares for Error 

3.61 Reasons for Subdivision. The sum of squares for error can be 
partitioned into components in the same way as that for treatments. 
Although such subdivisions are not so frequently required as with treat- 
ments, they have a number of uses. Sometimes there is reason to believe 
that the eiTor s.s. is not homogeneous; that is, the residual errors e do 
not all have the same variance o-^ as postulated in the mathematical 
model. In this case, as will be showm in section 3.63, subdivisions of the 
error may be necessary in order to obtain valid Wests. Occasionally a 
gross error in some observation may be suspected, and it is helpful to 
calculate the contribution of this observation to the error s.s. In ad- 
dition, a subdivision may help in understanding the nature of the error 
m.s. 

3.62 Rules for Subdivision. These follow the same general pattern 
as with treatments, though they are a trifle more complicated. It is 
necessary to go back to the normal equations (3.4) from which the treat- 
ment and block effects were estimated. These equations took the form 

23 iVijk - m -k- hj) = 0 (3.4) 

In other ^vordSy the residuals that provide the estimates of error must 
add to zero over any treatment or over any block. Consequently^-, if the 
linear function 

is to be a component of the error, the coefficients must sum to 5iero 
over all observations that receive any specified treatment and also over 
all that are in any specified block. These are the tests by \Yhieh we tell 
whether any proposed 0 is part of the error. 

For any z which satisfies these tests, the contribution to the error s.s. 
is where by rule 5 

D = 

We now examine some components of the error s.s. in the eelworm ex- 
periment. Obviously, the. difference z l)etween the eelworm numbers on 
any two control plots in the same block satisfies the conditions. Sim- 
ilarly (my comparison among the 4 control observations in a block is a 
component of the error. It follows that the sum of scpiares of deviations 
of these 4 observations from their mean contributes 3 d.f. to the error 
s.s. The 4 blocks together contribute 12 d.f . of this type of error. The 
sum of squares will be found to be 307,312. This component is a meas- 
ure of the amount of variation among observations that receive the same 
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treatmerit and lie in tlie same bloekj and is the type of component that 
we should expect to constitute the error. However, this type of com- 
ponent is encountered only when some treatments have been replicated 
mthin th^ hloek. 

Consider now the difference between two other treatments, say (ICN) 
and (ICS). Let 

where the subscript I denotes that the two observations come from 
block L This function is not a component of the error, for, while the 
coefficients add to 2 ero over every block, they do not do so over treat- 
ments (lCi\r) and (ICSO- B^it if we take 

- (lCiV)i -- (lC.S)i -- (lCiV)2 + (ICS)2 

the conditions are satisfied. This expression shows how the difference 
between two treatments changes from one block to another. It is a 
measure of the effect of the blocks on the treatment difference, and IN 
usually called an interaction of treatments with blocks. Such inter- 
actions are the typical components of error. 

If Zi is the difference between (ICN) and (ICS) in the fth block, any 
comparison among the Zi satisfies the rules required for a component of 
the error. Hence the sum of squares of deviations of the Zi^ when divided 
by 2, contributes 3 d.f . to the error s.s, A similar result holds if we take 
any comparison among the treatments and calculate it separately for 
each block. Since the 9 treatments provide 8 independent comparisons, 
we see that the 24 clf. for the interactions of treatments with blocks may 
be divided into 8 sets of 3 di. The sum of squares for these 24 di, can 
of course be found by subtraction as shown in table 3.7. 


TABLE 3.7 Subdivision of the sum of squares for error 



il.f. 

s,s. 

m.s. 

Error 

36 

544,690 

15,130 

Among controls 

12 

307,312 

25,609 

Treatments X blocks 

24 

237,378 

9,891 


The F-ratio of the two components of error is 2.59, with 12 and 24 d.f. In 
the ordinary table this corresponds to a significance level of about 2J4%. 
It is probably more correct to make a two-tailed test by doubling this 
probability. Either test indicates that the controls are more variable 
than the other treatments. This is not very surprising, since in data of 
this type the variance may tend to increase as the mean increases. In 
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CALCULATION OF A SEPARATE ERROR 

any event the result throws doubt on/ the suitabH^ of the original 
mathematical model; it would seem better to ascribe different error 
variances to the control and the fumigants. 

The control also contributes to the treatments X blocks component 
of the error. The 8 comparisons among the 9 treatments can be divided 
into 1 between the control and all fumigants, and 7 which are com- 
parisons among the fumigants themselves. The former will contribute 
3 d.f. to the treatments X blocks interaction; the latter, 21 di. If the 
control is more variable than the fumigants, we would expect the mean 
square for the 3 d.f. to be larger than that for the 21 d.f., though it should 
be I'emembered that a mean square with only 3 di. is poorly determined. 
In the next section tliese two components will be computed separately. 

3.63 Calculation of a Separate Error for a Treatment Comparison. As 
the eelworm example illustrates, certain treatments may be erratic in 
their effects, while others in the same experiment show more stable re- 
suli.B. Wlum this occurs, com])arisons involving the erratic treatments 
have a higlier experimental error variance than those among the more 
stable treatments. Since the error m.s. in the analysis of variance is some 
weighted average of these different variances, its use for individual 
tests may not be valid. In such cases it is helpful to he able to calculate 
a separate error for any specific treatment comparison. The procedure 
Will 1)0 illustrated for the comparison between the control and the 
fumigants. 

The first step is to compute the comparison sepa.rately for each block. 




Totals in block 




1 

2 

3 

4 

Total 

Control 

1814 

1295 

1769 

980 

5858 

Fumigants 

2569 

1780 

2083 

1490 

8822 

2 (Control) — (Fumigants) 

1059 

810 

555 

470 

2894 


Since there are 4 control plots and 8 fumigated plots in a block, the con- 
trol total is .multiplied l>y 2 in forming the comparison. The contribu- 
tion to the error s.s. is 

(1059)“ + (810)2 ^ (555)2 + (470)2 (2894)2 _ 

--('V y24:;;: 

The divisor 24 is found by the usual rule. The mean square is 2954, 
with 3 d.f. By subtraction, the mean square for the interaction of fu- 
migants with blocks (21 d.f .) is found to be 10,882. Contrary to an- 
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ticipation, the component involving the control has the smaller mean 
square, though the difference does not approach significance. 

As often happens when the simple model is inadequate, the best 
approach for a more accurate analysis is questionable. It is possible 
to compute a separate error, by the method just given, for each if-test 
that is made. This is the procedure least open to criticism, since it 
makes no assumption that errors are constant from one treatment to 
another. Since, however, each ^-test would be based on only 3 d.f., 
these tests are msensitive, so that this method should not be used with- 
out good reason. In the present case it is probably justifiable to pos- 
tulate only two error variances. The first, appropriate to the controls, 
is estimated by the mean square 25,609, with 12 d.f.; the second, for the 
fumigants, by the mean square 10,882, with 21 d.f. By this approach 
the variance of the mean response to the double dressing, for example, 
is estimated as 

10,882 25,609 

' 16,. m 

since each mean is taken over 16 plots. 

Separation of the error s.s. into smgle components is seldom required 
except in special studies or as an exercise. One method is to set up 8 
orthogonal comparisons among the treatments. Calculate each com- 
parison separately in each block, and let Zij be the value obtained for the 
fth comparison (f — 1, 2, • * *, 8) in the jil\ block. Then find 

W^l == Zii ^ 

30^4 

Each WJ is a component of the error, and the whole 24 are mutually 
orthogonal. The squares of the w% each with its proper divisor, give a 
separation of the error s.s. into single components. 

3J Missing Data 

3.71 Method of Handling Missing Data in the Analysis. From time 
to time certain observations are missing, through failure to record, gross 
errors in recording, or accidents. The omissions naturally affect the 
method of analysis. With each of the common designs we give computa- 
tional instructions for analysing data that contain gaps. The object of 
this section is to indicate the theoretical basis for these methods. 

When certain observations are absent, the correct procedure is to 
write down a mathematical model /or all ohmrvations that are pTesent 
The least squares normal equations are then constructed in the usual 
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way. These take exactly the same general form as when all observa- 
tions are present; i.e., 

Z {y-ijh - m -ti -bj) = 0 (3.20) 

over all observations whose equations contain any specified parameter 
that is to be estimated. Since, however, the terms in the equation cor- 
respondiiig to missing observations are absent, the system of equations 
loses some of the symmetry that it possesses when all obseivations are 
present, and the solutions are more difficult. The same general procedure 
supplies F- and l-tests of hypotlie>ses about the nature of the treatment 
effects as described in section 3.24, though again the details become more 
coniplicated. 

In the analysis of variance, two changes may be noted. Owing to 
the missing observations, the treatments and blocks s.s. become en- 
tangled, so iliat the treatments s.s. must he computed after allowing 
for block effe(*.ts, as mentioned at the end of section 3.26. Secondly, 
if a ol.)servat ions are absent, the total number of degrees of freedom 
is reduced by a. Unless one or more complete treatments or blocks 
is missing, the number of parameters required to describe these effects 
will be tlie same as before. Consequently, the missing degrees of free- 
dom all ciome from the e.iTor s.s., Much now represents (iie a) degrees 
of freedom. In short, missing data may be handled by applying the 
standard least squares procedure to all observations that are not miss- 
ing. For future relereiicc, this method will be called the '^correct least 
squares proccdn!‘e/b 

To the experimenter it may be a difficult business to carry out the 
construction and solution of a set of unfamiliar normal equations, even 
though he is quite competent to analyze a set of complete data. For this 
reason Yates (3.9), following a suggestion by Fisher, considered inserting 
values for the missing observations so as to obtain a set of complete data, 
Suppose that only a single observation is missing, and that a value x is 
substituted for this observation. If the analysis of variance is calculated 
in the usual manner for complete data, the error s.s. is found to be of the 
form 

Ax^ 2Bx + C 

where A, B, and C are numbers that are determined by the type of de- 
sign and tlie values for the other obseiwations (J. is always positive). 

In order to find a numerical value for x, Yates proposed to use the 
value that minimizes the error s.s. This isx ^ B/ A. If this value is in- 
serted in place of the missing observation, and if the data are analyzed 
as if no observations were absent, Yates showed that several important 
properties hold, (i) The estimates of treatment and block effects are 
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exactly the same as those obtained by the correct least squares pro- 
cedure. (ii) The error s.s. is exactly the same as given by the correct 
procedure, (iii) To obtain the correct partition of the degrees of freedom, 
we subtract 1 from the total s.s, and 1 from the error s.s. 

Yates also showed that the method of insertion fails to agree wdth the 
correct least .squares procedure in two respects. The treatments s.s., as 
obtained in the analysis of variance of the ^ Complete” data, is always 
slightly larger than the correct treatments s.s. for an i^-test of the treat- 
ments. Unless an appreciable fraction of the total observations is miss- 
ing, tills overestimalion is unlikely to be large; further, the exact F-tevSt 
can he obtained by means of some additional calculations. The second 
defect of the method of insertion is that it may not give proper iJ-tests. 
That this will happen is clear, because in the analysis of “complete’' 
data r replications are ascribed to the treatment that contains the missing 
observation, whereas there are only (r — I) replications. To allow for 
this disturbance we give special rules which provide ^-tests that are ap- 
proximately correct. 

Thus the method adopted with a missing value is first to “estimate” 
this value by means of the formula B/A, which will be presented for the 
common designs. This estimate is used in place of the missing value, and 
the rest of the analysis is conducted as if the data were complete (except 
for the changes in degrees of freedom). Special methods are available 
for exact F-tests and for Mests. If several observations are aUsent, a 
repeated application of the formula enables values to be substituted for 
each missing observation. 

This method is essentially an ingenious computational device whose 
piu'pose is to enable the easy computations that applj^ to complete data 
to be used even when data are incomplete. Substitution of estimates 
for the missing data does not in any way recover the infomiation that is 
lost through loss of data, as some experimenters have suggested, usually 
facetiously; it merely attempts to reproduce the results obtained by an 
application of the least squares method to the data that are present. 
The only complete solution of the “missing data” problem is not to have 
them. 

3.8 The Analysis of Covariance 

3.81 Purpose of the Technique, As indicated in section 3.2, experi- 
ments can be planned so that certain types of environmental effect are 
eliminated from the estimates of the treatment effects, with the result 
that these estimates are made more accurate. In the eelworm experi- 
ment, where the plots were grouped into blocks of 12 plots each, any dif- 
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fereiices from block to block in the severity of eelworm infestation were 
eliminated in this way. On the other hand; differences in infestat^^^ 
from ^plot tojM wlihm Vm }^mns block are not controlled by the design and 
do contribute to tlie experimental errors, since treatments were assigned 
at random to plots within each block. Accordingly, before the fumigants 
were applied, samples were taken in order to estimate the natural in- 
festation cor each plot. 

The analysis of covariance shows how to use these supplementary data 
to reduce the experimental errors by eliminating the effects of variations 
in the initial infestation within a block. The technique is potentially 
very useful. It oiten happens that some source of variation which can- 
not be controlled by the design can be measured by taking additional 
observations. Whenever this is so, the anatysis of covariance can be 
utilized, often to great advantage. One caution is that, since the ad- 
ditional ul)SC3rvations are t o measure etiviroiimmtal effects, they must not 
1)0 influenced Iry the treatments. The situation where such measure- 
ments ure^ influonced l)y the treatments is discussed in section 3.88. 

3.82 Initial Steps in the Analysis. The first step is to construct a 
new mtithematical model. If refers to the final eelworm count and 
Xijk to the initial, the relation is 

fiijt — M + Ti + ih + if') + c/jfc (3.21) 

fl’he only idiangt^ is tlie ini roduct ion of a new term to describe the effect 
of tlit‘ initial eelworm number. We have assumed that the effect is 
linear, i.e., it is a eoiistant multiple y of the amount by which the initial 
eehvonn number Xijk on the plot differs from the average initial number 
■X for the ^vholc experiment. 

As before, the unlaiowiis are estimated by least squares. In this 
case we minimize 

I] llUjk - m - ti - hj - cix^jk - a')]“ (3.22) 

The normal equation for m is again of the form 

Y hjijk - m - t! - hj - c{xijk - «)] = 0 (3.23) 

over all oteoi'vatious. 1 f the same linear restrictions as before are ap- 
plied in the /,• ami the ft,, equation (3.23) implies that m is the mean of the 
yijk. The equation for U is the same as (3.23) except that the sum is over 
those obsei-vations that receive U The equation may be rearranged as 



76 N0T15S ON THE STATISTICAL ANALYSIS OP THE RESULTS 3.83 

where n is the number of replications, and T denotes a treatment total. 
This gives 


tfz *4^ t* 


Tiy 

n 




(3.25) 


The important feature of this equation is that in order to obtain ti, the 
observed treatment mean of the y’s (.Tiy/vi) is adjusted, the adjustment 
depending on the treatment mean of the a:’s. It is this adjustment that 

removes the effect of the initial infestation. 

The equation for c, the adjustment factor, will not be developed in 
full It turns out that c is given by the ratio of the error smn of products 
of y and a: to the error S.S. of 

3.83 Computations. In practice we start with a joint analysis of the 
sums of squares and products of y and z. In order to illustrate certain 
features of the covariance technique, the treatments s.s. rvill be sub- 
divided into “linear” and “curvature” components, as wms done for y 
in table 3.5 (p. 64). The original values for x are given above the y 
values in table 3.1 (p. 40). The treatment totals and the quantities 

TABLE 3.8 Tbbatment totals fob nltmbeks op cysts * 



Before fumigation (x) 

After fumigation (?/) 

Level 1 

ON 

as 

CM 

CK 

Total 

CN 

cs 

CM 

CK 

Total 

0 


1975 


1975 ' 


5868 


5858 

1 

: 402 

417 

513 

570 

1002 : 

1066 

928 

1431 

892 

4317 

2 

^ 389 

554 

568 

778 

2289 , 

1265 

877 

1241 

1.122 

4505 

2(2) +(1):L 

1180 

1525 

1649 

2126 

6480 

3596 

2682 

3913 

3136 

13327 

2(1) -(2) :a 

415 

280 

458 

363 

1615 

! 867 

1 

979 

1621 

062 

4129 


* Numbers of cysts per plot were the numbers found in 400 grams of soil. Tbu.s 
; the totftb in the table represent 1600 grams of soil, except for the no-fumigant totals, 

I which represent 6400 grams. 

! 

j needed for isolating the linear response and the curvature are reproduced 

I • , in table 3.8. The analysis of the sum of squares of x follows by the 
I same methods as given previously for y. 

I . To analyze the sum of products, we carry out the same operations as 

f , for a sum of squares, except that at every stage a square is replaced by 

I ■ . j ' V the corresponding product. A few examples will suffice. 
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77 


3.S3 


Totiil: (2li!))(4(i0) + (2S3)(2Sn) + • . . + (4S)(298) - = 366,929 


^ (lOrojloSoS) , (-102)(1000) , 

Treatments: -— 7 -; 1* — 

It) 4 


48 

(778)(1122) 


(616G)(14,680) 

48 


Average linear reH]>ense 


(2280 - 1975)(4505 5858) 

32 ■ ' 


== -9222 


"13,276 


The complete analyssis of covariance is shown in table 3.9, 


TABIT^ 3.9 SuM?^ of sqt: 

ARES AN'D PRODUCTS {x 
FUMiGATTOX) 

- BEFORE, 

y =5 AFTER, 


d.f. 


W 

iv^) 

Blocks 

3 

159,618 

175,873 

289,427 

Treafmcrifs 

8 

29,142 

-0,222 

157,448 

Average liru'ar resjionse 

1 

3,081 

-13,276 

57,207 

AvtT'age (nirvniiire 

1 

2,204 

8,285 

31,140 

DiftVrences in linear 

3 

22,975 

-6,837 

43,408 

Diffeivnees in. curvature 

3 

8S2 

2,606 

25,693 

Error 

36 

121,408 

189,278 

544,690 

4?ota.l 

47 

310,168 

355,929 

991,665 


If we (lenolo the suras of si|uares and products for error by Exxj Eyyj 
and Eyxj respc*cti\a)iy, the udjustinent factor c, or regression coefficient 
of jj on ;r, is given ht" 





■ 


189,278' 

12M08 


1.559024 


It may seem surprising that the coefficient is greater than 1. This is 
probably explained by the great seasonal increase in the eelworm num- 
^'bem, obvioiisTroni'taMe;3;.8.:' ^ 

'Fhe residual error s.s. may now be found. The original error s.s. is 
544,090, with 30 d.f. To remove the effect of the regression on the 
initial eeiwoiin numbers, \va subtract 

(189,278)^ 




yy 


■w 2 

' fi.XX': 


121,408 
844,690 - 


295,089 


295,089 = 249,601 


Thus the residual s,s. is 249,601, with 35 d.f., since 1 d.f. must be sub- 
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where r,- is the number of replications, and J’ denotes a treatment total. 
This gives 


m -h U 


T, 


w 




(3.25) 


The impox’taiil'j featiH’O of this equation is that in order to obtain the 
observed treatment mean of the y^B {Tijf/ri) is adjusted, the adjustment 
depending on the treatment mean of the x^s. It is this adjustment that 
removes the effect of the initial infestation. 

The equation for c, the adjustment factor, will not be developed in 
full. It turns out that e is given by the ratio of the error sum of products 
of y and x to the error s.s. of x. 

3,83 Computations. In practice we start with a Joint ana.tysis of the 
sums of squares and products of ^ and x. In order to illustrate certain 
features of the covariance technique, the treatments s,s. will be sub- 
divided into “linear” and “curvature” components, as was done for y 
in table 3.5 (p. 64). The original values for x are given above the y 
values in table 3.1 (p. 40). The treatment totals and the quantities 

TABLE 3.8 Treatment totaes fob numbers of cysts * 



Before fuDiigation (x) 

After fuinigation (i/) 

Level 

ON 

CS 

CM 

CK 

Total 

CN 

CS 

CM 

CK 

Total 

0 


1975 


1975 


5858 


6858 

1 

402 

417 

513 

570 

1902 

1066 

928 

1431 

892 

4317 

2 

: 389 

654 

56S 

778 

2289 

1265 

877 

1241 

1122 

4505 

2(2) +(1):L 

1180 

1525 

1649 

2126 

6480 

3696 

2682 

3913 

3136 

13327 

2(1) -(2): Q 

415 

280 

458 

362 

1515 


979 

1621 

662 

4129 


* Numbers of cysts per plot were the numbers found in 400 grams of soil. Thus 
the totals in the table represent 1600 grams of soil, except for the no-funiigant totals, 
which represent 6400 grams. 

needed for isolating the linear response and the curvature are reproduced 
in table 3.8. The analysis of the sum of squares of x follows by the 
same methods as given previously for y. 

To analyse the sum of products, we carry out the same operations as 
for a sum of squares, except that at every stage a square is replaced by 
the corresponding product. A few examples will suffice. 
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COMPUTATIONS 


77 


Totu!: (2ti'.))!4W;t) + (2S3)(2S0) +■■■-{■ (48)(298) - 


(6166)(14.680) 


^ (1975)(5858) , (402)(1066) , 

Tri'atint-'iits: — — i ; h 1- 

It) 4 


48 

(778)(U22) 


- 355,929 


4 


(6166)(14,680) 

48 


Averag^.^ linear response: 


(22S9 - 1975)(4505 -- 5858) 

■ ■ '32 ■ 


-9222 

-13,276 


The conirilete analysiB of eovariaiice is shown in table 3.9. 


TABLE 3.9 Suais of sqitares and products (jc 

fumigation) 

di. 0.^ 

= BEVOKB, 

(yx) 

II 

Blocks 

3 159,618 

175,873 

289,427 

Treat nunits 

00 

to 

-9,222 

157,448 

Av<.‘vage linear response 

i 3,081 

-13,276 

57,207 

A.\T‘n.ig(3 curvature 

1 2,204 

8,285 

31,140 

Dli’ft'rcnees in linear 

3 22,975 

-6,837 

43,408 

DitTercju'cs in cvirvature 

' 3 ■ ■ 882 , 

2,606 

25,693 

.Error 

36 121,^108 

189,278 

644,690 

Total 

47 310,168 

355,929 

991,565 

If wo denote t.i'ie Finns 

<,>f squares and products for error by E^ 


and rc‘B|HH5tive!y, the adjiisLruent factor a, or regression coefficient 
:of f on given l.>y' , 




189,278 


J?** 121,408 


1.559024 


It iTiay seem sm’prising that the coefficient is greater than 1. This is 
in'obably o.xplaiued Iw the great seasonal increase in the eelworm num- 
bers, obvious from table 3.8. 

The residual error s.s. may now be found. The original error s.s. is 
544,090, with 30 d.f. 'Co remove the effect of the regression on the 
initial eelworm numbers, we subtract 


(189,278)2 




295,089 


■'vv 




121,408 
544,690 - 295,089 = 249,601 


Thus the residual s.s. is 249,601, with 35 d.f., since 1 d.{. must be sub- 
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tracted for the additional parameter 7. The residual m.s. is 
249,601/35, or 713L This is less than half the original mean square 
of 15,130, and indicates that the use of covariance has approximately 
doubled the accuracy of the experiment. 

One feature of the covariance method deserves comment at this point. 
In. the model the effect of .r was assumed to be linear, but no assumption 
was made about the stengtli of the effect. That is left to be determined 
by the data. If the eelworm numbers on each plot had remained un- 
changed throughout the season, except for the influence of the treat- 
ments, and if the eelwonn nunibers bad been accurately measured, the 
residual s.>s. \vould have been zero. The residual s.s. as found presumably 
represents a contribution due to seasonal variations in infestation and 
one due to the fact that the eelworm numbers were not estimated ac- 
curately, being obtained oni^^ from small samples of soil. If the variable 
X had had linear effect on the residual m.s. would have been the 
same as the original mean square, apart from sampling fluctuations. 

3,84 The Adjusted Treatment Means. A little time is saved if iiie 
adjusted treatment totals are found finst. From equation (3.24) the 
adjustment to the fth treatment total of y is 

c(/ 

where is the number of replicates, and x the general mean of x (128.46). 
Thus for the control the adjusted total is 

5858 - .L559()(i975 - 16 X 128.46) - 5858 + 125 = 5983 
and for {WN) 

10(>6 -- 1.5590(402 - 4 X 128.46) - 1066 +• 174 - 1240 
The ?nmns are shown in table 3.10, 

TABLE 3.10 AdJCSTED TEEATAmNT MEANS (b,elworms per 400 GM. som) 


Fumigant 

Level, oi r— — — ■ — ■ — 


dretssing 

CN 

CS 

CM 

CK 

0 

1 

310 

374 

270 358 

201 

2 

365 

204 

280 

178 


3.86 t-tests. These tests are slightly complicated by the fact that 
account must be taken of the sampling error of the adjustment factor c. 
The difference between two adjusted means may be ■written 

•> Vi- y-i - c(»r — X2) 
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From regression theory the variance of this quantity is given by 

+ i^x - 


«.■/ (- + -) 




1 ^ {^1 — 

^ 

n ra E,, 


(3.26) 


where is the residual error variance, and E^x the error s.s. for x. As 
an estimate of o-,/./ we use the residual error m.s., in this case 7131 with 
35 d.f. 

Example. .4 t-test of the reduetion to the double dressing of 08. The re- 
duction is (374 — 204), or 170 eelworms per sample. The e,stimated 
variance is 


(7131) 


1 1 (123.44 - 138.50)2' 

_ -1- _ -I- 4 ___i. 

Ll6 4 121,408 


(7131) (0.31437) = 2242 

(3.27) 


the means of the a:’s being found by division from table 3.8. Hence 


170 170 

t = --=z = ' = 3.59 

V2242 47.35 

with 35 d.f., which is highly significant. 

For a moi’e complicated comparison of the adjusted means, 

'E/ iMi - cfe - «)] 

the estimated variance is 



7.2 

E- + 


(Em 



(3.28) 


(The term in x vanishes because for any comparison the .sum of the 
must be zero.) 

One annoying feature of these tests is that the x values enter into the 
variance, so that every comparison necessitates a separate computation 
of the variance. As a time-saving approximation, Finney (3,10) has 
})j‘oposed that an average value for the contribution of the term in the 
.r^s may be used. This amounts to using 




1 -f- 


Exxi 


(7131) 


1 + 


3643 

121,408J 


(7131)(1.030) = 7345 


as the effective residual error m.s., where 4* is the treatments mean square 
for X. The term in brackets represents the average contribution from 
the x’s, or in other words the contribution from sampling errors in the 
adjustment factor c. 


.OTFS ON THE STATISTICAL ANALYSIS OF THE RESULT, 
llius, 101 wi ^ ^^.Quld use 

f the control and 20^,110' 

V* M .ivon by tb. mo...ex»t ex„re«o„ (3.27). 

;£„(y, (.*=.) ol (3.28), 

,Z^Z(£) 

11 ,r .nd enou'di if the number of error de- 
This approximation is , J, eases the contribution from 

grces of freedom exceed. O s .^. 

errors in c IS small, iliemoi 

. Tl 1 fact tint the same adjustment factor appears in 
3.86 F-tests. Thr, fact t , ^ p.-needure rs shown 

every treatment mean also mtluenu.s . 

in table 3.11. 

table 3.11. i-VmsT wrrH mn ANAmms of covAui.mcn 

I Kesidiials 


’i:'reatnients 

V^Tor 


1 rw left of the vertical line are from the previous analysis 

“ “SiflO.” ^2,138, »bt.- «» 

ressiononr, /.onorm-a 

= 21.1,3-15 

1.50,550 

ai ■ /In.. ARfi 703 ia entered in the column headed h sidutds s...‘. 

the 486,793, Tte x..mc 

l3b74“-« '» 

: . : ; . .. .7 .ho8! ICO - AfiK7Q.3 — 240.601 


ail 

ilVcEST 

WI'l’H THE 

ANAIaTSIS of c 

■1 f 

(*^) 


(;r) 'i.f. 

.i*i* 

8 

36 

29,142 

121,408 

-9,222 

189,278 

1.57,148 1 8 

511,090 1 35 

44 

150,550 

180,0.50 

702,138 13 




B.87 THE INCBEASE IN ACCURACY DUE TO COVARIANCE SI 


It always lias the same number of degrees of freedom as the original 
treatments s.s; The F-test of the adjusted treatment means is given by 
the ratio of the residual m.s. 


29,649 

"nsF 



with 8 and 35 d.f. 

If it is desired to test some component of the treatments s.s,, the same 
calculation is made with the component in place of the treatments s.s. 
throughout. If several components are to be tested, this becomes rather 
tedious. A useful approximation is to construct from the original anal- 
ysis of covariance ill table 3.9 an analysis of {y — ar)^. This is most 
easily done by multiplying each term in (;r“) by or 2.43048, each term 
in (y.r) by —2c, or —3.11805, and adding the two products to {y^)\ The 
results are shomi in table 3.12. 

. ■ k ' ' 


TABLE 3. 12 Analysis 
d.f. 


Treatments S 

Average linear re.spnnse 1 

Average ciirvature 1 

Differences in linear 3 

Diherences in curvature 3 

Error 35 


r VABIANCIil Of {y — cx) 


s.s. 

m.s. 

¥ 

F'' 

257,032 

32,129 



106,000 

106,090 

14.88 

14,51 

10,664 

10,664 

1.50 

1.47 

120,506 

40,189 

5.64 

5.05 

10,711 

6,570 

0.92 

0.92 

249,592 

7,131 




It will be noted that this calculation gives the correct residual error 
m.s. However, sums of squares for components of the treatments are ' 
always larger tiian those given by the more roundabout correct pro- 
cedure, Thus the F values shown on the right are all too large. The 
overestimation is seldom great, and the F values serve for a preliminary 
inspection. Those that are just beYond the significance level may be 
recomputed by the correct procedure if it is thought worth while. For 
comparison, the F' values shown above are those obtained by the 
correct procedure (it is a useful exercise to check them). With either 
F or F^ there is a significant average linear response and significant 
differences among fumigants in their linear responses, while the curva- 
ture terms do not approach ^sigmficanGe. In the analysis made without 
covariance (table 3.5, p. 64) no component was significant. 

3,87 The Increase in Accuracy Due to Covariance. From table 3.9 
we see that with no covariance the error s.s. for y is 544,690, with 36 
di., giving an error m.s. of 15,130. A comparable figure when covariance 
is used is the effective residual m..s., 7345. This is preferable to the 


82 NOTES ON THE STATISTICAL ANALYSIS OF THE RESULTS 3.87 

residual m.s, itself (7131) because it makes allowance for the sampling 
error of the adjustment factor c. The accuracy obtained with covariance 
relative to that without covariance is estimated by 15,130/7345, or 2.06. 
The use of covariance appears to have had about the same effect as 
doubling the number of replicates. In making this comparison, we ig- 
nored the effect of the reduction in error d.f. from 36 to 35, because it is 
negligible. 

As in ordinary regressions, the x variable may be transformed to an- 
other scale if this is likely to produce a more linear relation with ?/• The 
use of log for instance, is common in biological work. Two or more 
different x variables may be used. The calculations for this case are 
described by Snedecor (3.4, chapter 13). 

3.88 The Case Where the x Variable is Influenced by the Treatments. 
In a covariance analysis the treatment mean iji is adjusted by the amount 
—c(xi — x). The effect of the adjustment is to change each to the 
value that it would be expected to have if all treatments had the same 
X mean. It is in this way that the technique removes the effect of 
variations in the Xi, If, however, the differences among the Xi are in part 
produced by the treatments, the adjustment removes part of the treat- 
ment effect and its interpretation is changed. Consequently, in the 
standard use of covariance, it is important to be sme that the treatments 
did not affect the x values. Sometimes this is obvious, as in the eel- 
worm experiment, because the x values were recorded before the treat- 
ments were applied. A more doubtful case is that of the number of 
. plants per plot in a field ex|)eriment, counted after the application of 
the treatments, which may or may not influence plant numbers. An F- 
test of the x values is helpful in such cases. 

Where the treatments do affect a covariance analysis may add in- 
formation about the way in which the treatments produced their effects. 
In the eedworm experiment, the yields of the oats which were grown on 
the plots were obtained. Since eelworms attack oats, it would be in- 
teresting to kno^^r whether the effects of the treatments on the oats were 
simply a reflection of their effects on the eelworms. This is examined 
by a covariance analysis in which the oats yields are the y values, and 
the eelworm numbers after harvest are the x values. If the F-test of the 
adjusted treatment means still shows significance, the conclusion is that 
not all the treatment effect on the oats can be attributed to the reduction 
in eelworm numbers. This happened in the present instance, because 
some treatments supplied nitrogen to the crop, and therefore acted in 
part as fertilixsers as well as fumigants. As Bartlett (3.11) has pointed 
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out, the interpretation of this use of Govariance requires care, since a 
hidden extrapolation may be involved. 

When the x variables are influenced by treatments, the short-cut 
method for ^-tests by use of the effective residual error and the approxi- 
mate F-tests by means of an analysis of {y — c.r) should be avoided, 
since they may be seriously in error. 

3.9 Effects of Errors in the Assumptions Underl 3 dng the Analysis of 

Variance 

The assumptions made in the analysis of variance are that treatment 
and environmental effects are additive, and that the experimental errors 
are independently distributed in the normal distribution, with a common 
variance. In practice we can never be sure that these assumptions all 
hold, and often there is good reason to suspect that some are false. The 
consequences of failures in the assumptions and the remedial steps to 
be taken have been summarized recently by Eisenhart (3.12), Cochran 
(3.13), and Bartlett (3.14). Only a few comments will be given here, 

As a rule, the failure of an assumption will affect both the significance 
levels and the sensitivity of F- and ^-tests. Vflien the experimenter 
thinks that he is testing at the 5% level, he may actually be testing at 
the 8% level. Usually, though not invariably, the true significance 
probability is larger than the apparent one; that is, too many significant 
results are obtained. Also, there is usually a loss of sensitivity?’, in the 
sense that a more powerful test than the analysis of variance F-test 
could be constructed if the correct mathematical model were loiovm. 
There is a corresponding loss of accuracy in the estimates obtained for 
the treatment effects, since these, too, could be made more accurate if the 
correct model were known. 

Although generalization is hazardous, experience suggests that in the 
majority of experiments, at least in the field of biology, these disturb- 
ances are not sufficiently great to invalidate the technique. They do 
imply, however, that significance levels and confidence limits must be 
considered approximate rather than exact. For the same reason, the 
inflexible use of say the 5% significance level to divide the effects into 
those that are regarded as and those that are not is hardly jus- 

tifiable. 

The most serious disturbances appear to arise when the experimental 
error variance is not constant over all observations. Sometimes this 
happens, as mentioned previously, because certain treatments are erratic 
in their effects. In such cases, the appropriate error variance for com- 
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paring one pair of treatments might be four times as large as that for 
another pair, and the use of the same estimated variance for both com- 
parisons would lead to t-tests that were completely erroneous. lYhere 
this type of disturbance is suspected, the remedy is to divide the error 
S.S. into components ea(jh of which is homogeneous, as indicated in 
section 3.()3. 

The Barne problem niay arise because the expen^^^ errors follow a 
distribution that is decidedly skew. In such distributions the error 
variance for a treatment tends to be a function of the mean produced by 
the treatment. If the nature of the functional relationship is known, a 
transft.n’mation can be found that will place the data on a scale on which 
the error variance is more nearly constant. This transformation is then 
made on the observations before starting the analysis. The principal 
transformations that have been found useful are discussed by Bartlett 
(;hi4); they inthide logs, square roots, and (for data expressed in frac- 
tions) inverse sines. ■ ' 

Such transformations may also be useful in cases where treatment and 
environmental effects ai’e not additive. If, for instance, a treatment in- 
creases all observations by 20%, irrespective of the initial level, a change 
to logs will introduce additivity. Wlien transformations are made for 
this purpose, it should be realized that they will also affect the distribu- 
tion of the experimental errors. Fortunately, it often happens that such 
transformations also bring the distribution of errors closer to normality* 

Finally, the assumption that the errors are independent from observa- 
tion to observation may be obviously untenable. It is well known that 
crop ^yields on neighboring plots tend to be positively correlated, and in 
laboratory experiments observations made by the same person at about 
the same time tend tf> exhibit tlie same type of correlation. These cor- 
relations might completely vitiate tests of significance. The remedy in 
this case is the proper use of randomization, which, as it were, introduces 
independence in the assignment of treatments to the experimental units 
or ill the a^ssignment of the order in which observations are made, so that 
the errors may effectively be regarded as independent. For further dis- 
cussion of this question, see Yates (3.15), Fisher (3.3), Bartlett (3.16), 
and Welch (3.17). 
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CHAPTER 4 

COMPLETELY EANDOMIZE^ RANDOMIZED BLOCK, 
AND LATIN SQUARE DESIGNS 


44 Completely Randomized Designs 

4,11 Description. The simplest ty pe of layout is that in which treat- 
mcnts are allotted to the u nits entirely by chanc e. More specificaliyj if 
a treatment isLd be applied to four units, for example, the randomization 
gives every group of four units in the experimental material an equal 
probability of receiving the treatment. In addition the units should be 
processed in random order at all subsequent stages of the experiment 
where this order is likely to affect the results. 

This design has several conveniences: 

1. Compile Any number of treatments and 

of replicates may be used. The number of replications can be varied at 
will from treatment to treatment (though such variation is not recom- 
mended without good reason). ^111 t he available exp erimental ma terial 
canJ)ejitili£:ed — an advantage in small preliminary experiments where 
the supply of J.a.scarce, 

2. The stat istical analysis is eas y even if the numbers of replicates are 
not the same for all treatments or if the experimental errors differ from 
treatment to treatment. 

3. The method of analysis remains simple when the results from some 

units or from wlfole^tmalme^ are rejected. Moreover, 

the relative Icjss of information due to missing data is smaller than Avith 
any other design. 

The principal objection to a completely randomized design is on the 
grounds of actcuracy. Since the randomization is not restricted in any 
way to ensure that the units which receive one treatment are similar to 
those which receive another treatment, the whole of the variation among 
the units enters into the experimental error. For this reason the error 
, can often be reduced by the use of a different design, unless the units are 
highly homogeneous or the experimenter has no information by which 
to arrange or handle the imits in more homogeneous groups. 

Complete randomization seems the obvious procedure for many lab- 
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oratory experiments, e.g., in physics, chemistry, or cookery, where a 
quantity of material, after thorough mixing, is divided into small sam- 
ples or batches to which the treatments are applied. On the other hand, 
these designs are seldom used in field experiments, the method of ran- 
domized blocks having been found consistently more accurate. 

One fact compensates to some extent for the higher experimental er- 
rors as compared with other designs. For a given number of treatments 
and a given number of experimental units, complete randomization pro- 
vides the maximum number of degrees of freedom for the estimation of 
error. As pointed out in section 2.31, the sensitivity of the experiment 
increases as the number of error degrees of freedom is increased. Con- 
sider, for example, an experiment with 2 treatments in 4 replicates. The 
degrees of freedom for error are 6 under complete randomization as 
against 3 if the method of pairing (section 4.2) is used. From section 
2.31 it appears that the paired experiment must produce about a 14% 
reduction in the error variance in order to offset the additional iin- 
reliability in the estimate of error. This point is worth bearing in mind 
with small experiments. 

To summarize, complete randomization may be appropriate (i) where 
the experimental material is homogeneous, (ii) where an appreciable 
fraction of the imits is likely to be destroyed or to fail to respond, and 
(iii) in small experiments where the increased accuracy from alternative 
designs does not outweigh the loss of error degrees of freedom. 

4.12 Randomization. If the number of units does not exceed 16,, 
tables 15.(5 and 15.7 may be used. Suppose that there are 3 treatments, 
of which two have 4 replicates while the third has 8 replicates. Num- 
bers 1-16 are assigned to the units in any convenient order. A random 
permutation is drawn from table 15.7, say 

9, 13, 8, 5, 12, 1, 14, 16, 6, 7, 3, 4, 10, 11, 15, 2 

The first treatment is applied to imits 9, 13, 8, 5; the second to units 12, 
1, 14, 16; and the third to the remainder. 

With more than 16 units in the experiment, numbered discs, beans, 
or a book of random numbers may be used, as described in chapter 15. 

4.13 Statistical Analysis. Table 4.1 shows some of the results of an 
experiment on the effects of applications of sulphur in reducing scab dis- 
ease of potatoes. The object in applying sulphur is to increase the acidity ^ 
of the soil, since scab does not thrive in very acid soil In addition to 
untreated plots which serve as a control, 3 amounts of dressing were 
compared — 300, 600, and 1200 lb. per acre. Both a fall and a spring ap- 
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plic^aiion uf eiu'h amount was tested, so that in all there were 7 distinct 
treatments. The sulphur was spread by hand on the sui’face of the soil, 
and then disced in to a depth of about 4 inches. The quantity to be 

TABLE 4.1 Fiislb pnAN AiNtd scab indices for a completely randomized 

EXPERIMENT ON POTATOES 


F3 

9 

0 

12 

so 

18 

F12 

10 

SO 

24 

S12 

17 

S3 

30 

F6 

16 

0 

S3. 

F12 

F6 

S3 

0 

0 

S6 

10 

■ 7, 

4 

10 

21 

24 

29 

12 

F3 

S12' 

F6 

0 

F6 

S12 

F3 

F12 

9 

7 

18 

30 

18 

16 

16 

^ ■ 4 

S3 

0 

' S12 ' 

SO 

0 

F12 

0 

CO 

9 

IS 

17 

19 

32 

5 

26 

4 


Notation: F - fall/ S = spring application, 0 — control. The numbers 3, 6 , 12 
are the amounts of sulphur in 100 lb. per acre. 

Hesults grouped by treatments 


Totals 

M<.‘ans 


0 

P3 

S3 

F6 

S6 

FI 2 

S12. 

12 30 

9 

30 

16 

18 

10 

17 

10 18 

9 

7 

10 

24 

. 4 

' 7 . 

?4 32 

16 

21 

18 

12 

4 

16 

'29 26 

4 

9 

18 

19 

5 

17 

181 

3S 

67 

■ ■ ■,62 , ■ " 

73 

23 

57 G - 601 

22.0 

9.5 

16.8 

15.5 

18.2 

5.8 

14.2 


Source of variation 
Treatmenls {t - 

Error (N 


Analysis of variance 

d.f. s.s. 

■ 1) - 6 972.3 

-0 « 25 1122.9 


m.s. 

162.0 

44.9 


F 

3.61^ 


Total (iY - 1) - 31 2095.2 

* Denotes significance a.t the 5% level 

analyzed is the “scab index.” This is, roughly speaking, the percentage 
of the surface area of the potato that is infected with scab. It is obtained 
by examining 100 potatoes at random from each plot, grading each po- 
tato on a scale from 0 to 100% infected, and taking the average. 
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The design was a completely randomized one, with 4 replications of 
each sulphur dressing and 8 replications of the control, which received 
extra replication in order to obtain a fairly good estimate of the natural 
infestation. Incidentally, a randomized blocks layout might have been 
superior. 

The computations are very simple. Let denote an observation, 2\: a 
treatment total, 6 the grand total, the number of i*eplicationB of the 
?th treatment, iV — ^ the total niunber of observations, and 2 the 
number of treatments. 

Step 1 . Find the treatment totals and the grand total. 

Step 2. The sums of squares are computed as follows. 

( 501)2 

Correction factor: (7 ~ -r = — r— = 7843.8 

N 32 


Total: 'Ey- -C = (9)^ + (12)2 ^ ( 4)2 _ 7g43_8 


fp.2 

Treatments: — — C 

n 


( 181)2 , ( 38)2 + ( 67)2 + ■..+ ( 67)2 

n 


8 4 

Error: (Total s.s.) — (treatments s.s.) = 2096.2 — 972.3 


= 2095.2 
7843.8 = 972.3 
= 1122.9 


The analysis of variance is given at the foot of table 4.1. The F- 
ratio for treatments is significant at the 5% level. From the treatment 
means it appears that all dressings had some beneficial effect, and that 
the fall application was more effective than the spring one. There is 
little or no evidence that the higher dressings were more effective than 
the lowest dressing. If the summary is conducted from this point of 
view, we should isolate and test two individual components of the treat- 
ments s.s. : (i) a component measuring the average effect of all dressings, 
(ii) a component comparing the fall and spring applications. The follow- 
ing computations are required. 

Average effect of sulphur. The total over all dressings is 320, repre- 
senting 24 plots. Since the control total, 181, represents 8 plots, the 
compaiison is 

3(181) --- 320 = 223 


The contribution to the sum of squares in the analysis of variance is 
(223)V96, or 518.0. By the rules in section 3.42, the divisor, 96, is 

Fall versus spring application. The comparison is 


(38 + 62 + 23 ~ 67 - 73 -- 57) = -74 

It is obviously orthogonal to the previous comparison. The contribu- 
tion to the sums of squares is (74) or 228.2. These calculations 
lead to the analysis of variance in table 4.2. 
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llie remaining foiu* components of the treatments s.s. must represent 
comparisons among the levels of sulphur. The average reduction in scab 
due to sulphur is significant at the 1% levels while the superiority of the 
fall application is also significant. Differences among the levels show no 
sign of significance, 

TABl.E 4.2 ScBDivisiox of the treatments sum of squares in the 

EXPERIMENT ON POTATO SCAB 


Bourct^ of variation 

d.f. 

s.s. 

m.s. 

F 

Treatmr;‘nts 

() 

972.3 

162.0 

3.61 

Control vR. siilpliur 

1 

518.0 

518,0 

11.54 

Fall VR. Rpring application 

1 

228.2 

228.2 

5.08 

CcmiparisonR among levols 

4 

226.1 

66.5 

1,26 

Error 

25 

H22.9 

44.9 



* Denotes signififjance at the 5% level. 
Denotes significance at the 1% level. 


The conclusions might be phrased as follows. “The application of 
sulphur produced a significant decrease in the scab index, the averages 
being 22.6 for the untreated plots, 16.4 for plots with the spring applica- 
tion, and 10-2 for plots with the fall application. The fall application 
proved significantly better than the spring application. There was no 
indication that the higher levels of dressing were more effective than the 
lowest level.’’ It should be remarked that a continuation of this experi- 
ment and other experiments caused these conclusions to be modified, 
the higher levels having shown to better effect. 

A more extensive discussion of the analysis of experiments of this type 
is given by Snedecor (4.1). Note that, if certain observations are missing 
or have to be rejected, the method of computation remains exactly the 
same except for the slight complication that different treatments will 
usually have different numbers of replications. 

The practice of calculating a separate standard error for each treat- 
ment is still found in some types of experimentation. Unless the treat- 
ments have different error variances, the use of a pooled error, as given 
by the analysis of variance, is recommended since more sensitive tests of 
significance are obtained. In cases where the experimental error appears 
to vary considerably from treatment to treatment, a test of homogeneity 
of the error variance, due to Bartlett (4.2), may be made. This test is 
also described in Snedecor (4.1, section 10.13). 

It is worth emphasizing that, even if the treatments are randomly as- 
signed to the units, the above methods of analysis do not apply when 
restrictions on the randomization are introduced at a later stage in the 
experiment. Consider a chemical experiment in which samples of liquid 
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from the same bottle are treated in four different ways. The samples, 
being mutually indistinguishable, are assigned at random to the treat- 
ments. It is, however, practicable to treat only a few samples during 
each work period, and the investigator decides to complete a single 
replication at each session. The result of this additional control is that 
the differences among replicates are eliminated from the experimental 
errors and must equally be eliminated from the estimated errors. The 
analysis should follow the method described in section 4,2. This point 
will arise frequently in experiments where many physical operations 
must be carried out. 

4.14 Standard Errors. The estimated standard error of the difference 
between two treatment means is 




This formula applies when a pooled error is used, being the pooled 
error m.s. per unit, and ti and r 2 the numbers of replicates for the two 

treatments. If fi = r 2 = r, the formula reduces to h — • The degrees 


of freedom for t-tests are those in 

Example, The average effect of the sulphur has already been tested 
by an F-test. We will perform the same test by means of a ^-test applied 
to the treatment means. Since the control has 8 replications, and the 
mean of all sulphur dressings has 24 replications, 

Sd = V(44.9)(|- + *) = 2.736 


The mean scab index for the control is 22.62, and that for the dressings 
is 13.33. Hence 


22.62 - 13.33 

'~2Jm 


3.395 


with 25 d.f. The value of F is 11.53, in agreement with the F found in 
the analysis of variance. 

If the true error variances per unit are considered to be different for 
the tw^o treatments, the appropriate formula is 






where are the respective error m.s. per unit for the tAvo treatments. 


«, COMPI.HrrELY lANDOMBED DEBiaNS *■» 

nf ihp treatment difference to Sd does not follow 
In this ease the ratio of t ^ ^ i iv^sfonces The development of 

taljS‘vra«a V2) give b.l.les of 

from Fislmr’s iheory of fiducial probability. (Note that then Si coi 
Horn 1 ihliu u y 

responds to our si /n, etc.; vvucio mi, „,.n1->e,blv errs 

cessible the following approximation is suggested. It p ■ J ■ 
Hlightly’on the conservative side, in the sense that the value of t requued 

for significance may be slightly too lugli. _ ^ ,.f,cnpctivplv 

L('t 7 i\ n 2 be the numbers of degrees of freedom in si , 2 > P ' ^ y 

From the ordinary i-table record the significance levels k, h correspond- 
ing to ni and % degrees of freedom, respectively. The approximate sig- 
nificance level for the ratio d/s^ is 

W-di 

t' ^ 

Wi + W2 


Since t' always lies between the ordinary t values for ni and ?i 2 degrees 
of freedom, this calculation is needed only for those occasional cases 
where d is close to the borderline of significance. Further, when ni - 
no = n, t' is the ordinary t value for n degrees of freedom. 

"" Kxamiile. This experiment is a case where one might suspect that the 
error variamte would not be homogeneous. If a treatment is very suc- 
(:e.ssful in reducing scab, the scab indices for that treatment will all be 
close to zero; hence their variance must be small. With, a treatment 
sucii as the control the scab index has a greater possible range of vari- 
ation. Consequently, we might expect the error variance for a treatment 
to depend on the mean produced by the treatment. With only 3 d.f. 
for the estimated error variance per treatment, we cannot hope to detect 
small differences in the true error variances. Perhaps the most important 
comparison is that between the variance for the control and the average 
variance for the sulphur dressings. The error m.s. will be found to be 
70.0 for the control (with 7 d.f.) and 35.2 for the dressings (with 6X3 
or 18 d.f.). The F-ratio, 1.99, falls just short of the 10% level of sig- 

Consequently, the conclusions wiE be on a sounder basis if we ascribe 
different true error variances to the control and the dressings. For a 
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^-test of the average effect of sulphur we now take 

, /to.O 35 ^ ■■ /— — 

Sd - + = V 8.75 + 1.47 = 3.197 

Ms 24 

The value of t is 9.29/3.197, or 2.906. To test this at the 1% level by the 
approximate test above we have 

m - 7; n2 == 18; h = 3.499; = 2.878; wi = 8.75; wg = 1.47 

Hence the value required for significance is 

f (^-75) (3.499) + (1.4^^^ ^ ^ 

10.22 ' ^ ’ 

The average effect of sulphur now falls short of the 1% significance level. 
It remains significant at the 5% level. For comparisons among the dif- 
ferent sulphur dressings we would use ordinary i?-tests, where is taken 
as 35-2 with 18 d.f. The effect will be to enhance tlie significance of 
comparisons among the dressings, though the conclusions quoted pre- 
viously are not altered. 

As mentioned in section 3.9, it is usually better to handle this type of 
experiment, where the error variance is thought to be a function of the 
mean, hy converting the original data to a scale on which the error vari- 

TABLE 4.3 x^nalybiw of variance of the square roots of tite scab indices 


Source of variation 

d.f. 

s.s. 

m.s. 

F 

Treatiiieiits 

6 

18.22 

3.04 

4.19 

Control vs. sulphur 

1 

8.17 

8.17 

11.27 

Fall vs. spring apjilication 

1 

4.59 

4.59 

6.33 

Comparisons among levels 

4 

5.46 

1.36 

1.88 

Error 

25 

18.12 

0.725 


Error for controls 

7 

6.27 

0.896 


Error for dressings 

18 

11.85 

0.658 


Total 

31 

36.34 




ances will be more nearly constant. On the supposition that the error 
variance may be proportional to the mean, the square roots of the scab 
indices might have been analyzed. As a matter of interest this analysis 
is shown in table 4.3; square roots were recorded to only one decimal 
place. 

The error m.s. for the controls is now closer to that for the dressings, 
the ^'-ratio being 1.36 against 1.99 in the original analysis. Further, the 
F-ratios for most components of the treatments s.s, are higher than in the 
original analysis, suggesting an increase in the sensitivity of the analysis* 
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4.2 Single Grouping : Randomized Blocks 

4.21 Description. The essence of this design is that the e^^hnental 
" ' m aterial is dhdded into group s, each of which constitutes a singlejrial 
orj:^lication. At all stages of the experiment the object is to keep the 
experimental errors within each group as small as is practicable. ThuSj 
wlien the units are assigned to the successive groups, all units whic h go 
in tjie saim^-oiip sh.oiiId jjg closely comparable. Similarly, during the 
course of tJie experixnent, a uniform technique should be employed for 
all units in the same group. Any changes in technique or in other con- 
ditions that may affect the results should be made between groups. , 
This division into replications need be recognized only at those stages 
in the conduct of the experiment where the division may help to reduce 
experimental errors. In agricultural field experiments the division is 
made at the start -when the plots arc marked out in the field. Since 
neighbo 3 *ing plots are known to be more alike in fertility than plots some 
distance removed, each replicate consists of a compact group of plots 
and is made approximately square in shape if feasible. Cultivations 
designed to keep the land clean of weeds will usually be carried out with- 
out regard to the replications, because it is not believed that results are 
affected by the order in w^hich plots are cultivated. Similarly, the plots 
will generally be harvested in w^hatever order is most convenient. If, 
ho^vever, harvesting must be spread over a number of da^^s, it is w^ell to 
harvest the plots replication by replication, in case rainfall or other 
factors should produqe changes in the weight of the crop from day to 
day. In other types of experimentation no real distinction might be 
made between replicateslmtil relatively late in the experiment. 

The principal advantages of randomized blocks are as follow^s. 

L By means of the grouping, more accurate resu lts are usually ob- 
tained than with completely randomized designs. 

2. Any number of treatments and any numbe r of replicates may be 

i ncluded . """^trthe design asdesmibed abover^cE'Ti^^ have 

the same number of replicates. If extra replication is desired for some 
treatments, each, of these may be applied to tw^o units within every 
group. This device provides twice the standard number of replicates 
for the treatments in question, at the expense of some increase in the size 
of the group, which now contains more than a single replication. Sim- 
ilarly a treatment may be ijppiied three or four times in a group. 

3. The statisti cal analysis is strai ghtoward. Mishaps which neces- 
sitate the omission of a complete group or of tlie entire data from one or 
more treatments do not introduce any complication in the analysis. 
When data from some individual units are lacking, the '‘missing-plot'' 
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technique developed by Yates enables the available results to be fiilly 
utilized. Some extra computational labor is, however, involved, and if 
the gaps are numerous the design is less convenient in this respect than 
complete randomization. 

4. If the experime nta l error varia nce isJ ^^^jageiLlar t r eatments 
than for others, an unbiased error for testing any sp ecific combi nation 
of the treatment means can ^ilt be obtaine dr” 

No design is more frequently used than randomized blocks. Certainly 
if a satisfactory degree of precision is reached, there is little need to search 
for alternative designs. 

It is worth noting that the replication means provide unbiased com- 
parisons of the differences among replicates. Occasionally, these dif- 
ferences measure some property of the experimental material that is of 
interest. The variance-ratio test of the replications against the error 
m.s. requires some care in its interpretation. In a greenhouse experi- 
ment, for instance, a different type of soil might be used in each replica- 
tion. Significant differences among replications might be due either to 
differences in soil type or to differences in the positions of the replications 
within the greenhouse. 

4.22 Randomization. When the units have been grouped, the trea t- 
me nts are assigned at random to the un it s with in eac h group . A new 
randomizaticm is made f or every group . Unless the number of treat- 
ments exceeds 16, tables 15.6 and 15.7 are convenient for this operation. 

4.23 Statistical Analysis. The example comes from an experiment 
carried out by the North Carolma Agricultural Experiment Station at 
Rocky Mount, N. C., in 1944. The experiment tested the effects of 6 
levels of application of potash, supplying respectively 36, 54, 72, 108, 
and 144 lb, K 2 O per acre, on the yield and properties of cotton. The 
measure chosen for analysis is the Pressley strength index. This is 
found by measuring the breaking strength of a bundle of fibers of a given 
cross-sectional area. A single sample of cotton was taken from each plot, 
and 4 determinations were made on each sample. The figures in table 
4.4 are the means of these 4 samples.* The experiment was arranged 
in 3 randomized blocks of 5 plots each. 

To make the computing instructions more general we suppose that 
there are t treatments and r replicates, and that y denotes a typical ob- 
servation. 

* Since the machine which measures the index is calibrated in arbitrary units, no 
dimensions are ascribed to the data in table 4.4. The index can be converted ap- 
proximately into pounds per square inch by means of a regression formula. 
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Step 1. Find the treatment totals (?\’), the replicate totals 
the grand total (G). 

TABLE 4.4 Steength index of cotton in a eandomized blocks experiment 

Replications 

„ — — --n Coded 

1 2 B Totals dressing 

7.62 8.00 7.93 23.55 13 

8.14 8.15 7.87 24.16. 8 

7.76 7.73 7.74 23.23 3 

7.17 7,57 7.80 22.54 -7 

7.46 7.08 7.21 22.35 -17 


Trc^ai-meiits 
Pounds K 2 O per acTo 


Totals 


Analysis of variance 


Source of variation 
Replications 
Treatments 
Error 


Total 


Subdivision of the treatments s.s. 


Linear response 
Deviations 


Step The sums of s^piares are obtained as follows. 


894.4393 


Correction factor: C 


Replications: 23 
Ti’catments: 

ot.2 


Error: (total s.s.) — (replications s.s.) — (treatments s.s.) 

« L1790 - 0.0971 - 0.7324 0.3495' 

The F-ratio for treatments (table 4.4) is 0.1831/0.0437, or 4.19, which 
is significant at the 6% level with 4 and 8 d.f. The treatment totals 
suggest that the strength decreases with increasing applications of 
potash, though there is a hint of a maximum in strength for the 54-lb. 
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application. Accordingly it is worth while to examine the shape of the 
response curve. We will first fit a linear regression on the amount of 
dressing. Note that successive increments of dressing are not equal, the 
last two increments being twice as large as the first two. This is a 
fairly common practice in cases where the effectiveness of an increase in 
dressing is expected to be smaller at the higher levels of application. 

Although the regression can be calculated by the standard formula, 
a simple coding of the dressings lightens the work. If we place the 
low’^est dressing (36 lb.) at zero and take 18 lb. as 1 unit on the scale, the 
dressings may be coded as 0, 1, 2, 4, and 6, respectively. A further de- 
vice is to subtract a common amount from these values so that their 
mean is zero. Since the mean of the coded dressings is this is the 
amount that must be subtracted. To avoid fractions, we multiply all 
values by 5 and subtract 13 from each product. The resulting coded 
dressings are shown in table 4.4; the signs have been changed so that the 
regression coefficient will be positive. 

The regression coefficient is now obtained as 


(13)(23.55) + (8)(24.16) + (3)(23.23) - (7) (22.54) -- (17)(22.35) 
(3)[(13)^ + (8)2 + (3)^ (7)2 + (17)2] 


31.39 

1 ^ 


0.0180 


The factor (3) in the denominator is inserted to convert the treatment 
totals to means. The regression coefficient represents the average de- 
crease in strength for a unit increase on the coded scale. Since 5 imits 
on the coded scale correspond to 18 lb. KsOj tbe decrease in strength for 
each additional 18 lb. K 2 O is estimated as 0.090. 

The contribution of the regression to the sum of squares in the analysis 
of variance is (by rule 1, section 3.42) 


( 31 . 39)2 

~ 17 ^ 


= 0.5663 


as sho^vn in the analysis of variance. The contribution from the linear 
regression is significant at the 1% level. Since the mean square for 
deviations from the regression, 0.0554, is only slightly above the error 
m.s., there seems no point in investigating a quadratic regression, though 
the reader may care to verify that its contribution is small. 

The conclusion from the data is that increased dressings of potash 
produce a weaker fiber, the strength index declining by 0.090 for each 
18 Ib.-inerement in K 2 O. As an exercise we will obtain confidence limits 
for this rate of decline. We first require the estimated standard error for 
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4.23 


tlie 0.090 figure. This can be found by expressing the figure as a linear 
function of the treatment means yi mA applying rule 5a of section 3.51. 
We have 

0.090 = + 32/3 — Tfii ~ Wjsl 

so that the estimated standard ei’ror is 

Voj0437 6 




580 


\/0.0437 


V(i3)^8)" +7::+ (17)2 


0.0251 


s/mo 

Alternatively, this value could be derived by no ting t hat the square root 
the F~value for the linear regression, i.e., V" 12.96, or 3.6, is the ratio 
of 0.090 to its estimated standard error. Thus the latter must be 
0.09/3.6, or 0.0250. Finally, for the 80% confidence limits, we have 
(0.090 d= 0.025 X 1.397), or (0.090 ± 0.035), where 1.397 is the value of 
Hor a probability 0.20 and 8 di. 

4.24 Standard Errors. The estimated standard error of the difference 
between two treatment means is 


Sd = 


If some treatments i’eceive extra replication, the general formula is 




(-+-) 

Vi ^2/ 


In experiments where the error appears to be heterogeneous, a separate 
may be obtained for any pair or for any group of treatments. For 
this a new randomized blocks analysis is carried out on those treatments 
which belong to the group under consideration. 

Missing Data. The method of analysis when part of the data is 
missing is described by Yates (4.4). With a single missing unit, the first 
is to calculate a value for the unit by means of the formula 

tB tT — 0 

where B is the total of the remaining units in the block where the mwsin g 
unit appears, T is the total of the yields of this treatment in the other 
blocks, and G is the grand total; r and t are the numbers of replicates and 
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treatments respectively. The analysis of variance is then carried out as 
usual except that 1 d.f. is subtracted from the total s.s. and from the 
error s.s. 

The standard error of the difference between the mean of the treat- 
ment with a missing value and the mean of any other treatment is 



(4.2) 


When there are several missing values, for units a, h, d, • * * , we first 
guess values by inspection for alb units except a. Formula (4.1) is then 
used to find an approximation for a. With this approximation and the 
values previously assumed for c, d,; • * , we again use formula (4.1) to in- 
sert an approximation for 5. After a complete cycle of these operations, 
a second approximation is found for a and so on until the new approxi- 
mations are not matez'ially different from those found previously. The 
analysis of variance is then completed; for each missing unit, 1 d.f. is 
subtracted from the total and error s.s. 

Suppose that in the previous example two observations had been 
missing and that the data had appeared as follows. 


Replications 


Pounds K 2 O 

1 

2 

3 

Totals 

36 

a 

8.00 

7.93 

15.93 

54 

8.14 

8.15 

7.87 

24.16 

72 

7.76 

h 

7.74 

15.60 

108 

7.17 

7.57 

7.80 

22.54 

144 

7.46 

7.68 

7.21 

22.35 

Totals 

30.63 

31,40 

38.55 

100.48 


Since differences between replications are not pronounced, we might 
take as a trial valueTor a the mean of 8.00 and 7.93, or 7.96, For 
estimating bi, the first trial value of 6, by formula (4.1) we now have 
B = 31.40; T - 16.50; G - 100.48 + 7.96 = 108.44 
so that 

, (3)(3L40) + (5)(15.50) ^ 108.44 

:■ ■ hi = — — - ■ — (Ml 

8 

For estimating a 2 , the second trial value of a, we now^ have 


B - 30.53; T - 15.93; G - 100.48 + 7.91 - 108.39 
so that 

(3)(30.53) + (5)(15.93) - 108.39 


a2 — 


8 


7.86 
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Taking as 7.86, we find that the next trial value of & is 7.92. This is so 
close to the previous value that we may stop. Thus a = 7.86, 6 = / .92. 

The analysis of variance is now computed with these values inserted. 
There will be 12 d.f . in the total s.s., and 6 d.f. in the error s.s. The error 
m.s. is 0.049L 

To obtain a standard error for the comparison of two treatments, A 
and By we assign an “effective” number of replicates to each treatment. 
Ai\y replication of treatment A. is counted as 1 when both .4 and £ are 
present in the replicate, as Yz when is present but B is not, and as 0 
when A is missing. Tlie same rule is applied in scoring B. Suppose that 
in the example we are comparing the means of the 36- and 72-lb, dress- 
ings, The effective replication for each mea n is scored as I H- The 
standard error of their difference is taken as V + %), or a/i,333s^. 
If we are comparing the mean of the 36-lb. dressing with that of the 54- 
lb. dressing, which has no missing values, the score for the first mean is 
2, and that for the second 2}^ , so t hat the standard error of their dif- 
ference is 0^ a/ 0.9s^, 

This useful approximation is due to Yates; the exact formulae are 
laborious to compute. The approximation usually gives a standard 
error that is slightly too high. In the example the correct values for the 
standard errors of the two differences considered above are 1.0695 and 
0.9375, as compared with the approximate values of 1.1555 and 0.9495, 
respectively. 

As a result of the disturbance introduced by the missing units, the 
treatments m.s. is slightly too large; however, the va,riance-ratio test is 
unlikely to be much in error unless a substantial proportion of the units 
are missing. Yates (4.4) gives the method for obtaining an exact test. 

4.26 Estimation of Efficiency. If Ih and Ee are the block and error 
m.s. and n/,, and ric the block, treatment, and error degrees of freedom, 

UhEh +• + n^Ee 

- - ” - : ^ (4.3) 

~h Wj! + 

is an estimate of the error variance of a cornpletely randomized design 
with the same experimental material. Comparison of iJea*. E^, 
taking account of the change in numbers of error degrees of freedom 
(section 2.31), provides an estimate of the increase in accuracy which 
results from the grouping into replicates. 

The result in (4.3) may be proved in various ways. Since there is no 
complete discussion of results of this type in the literature, one method 
of proof will be sketched; the details, which are a matter of algebraic 
manipulation, will not be given completely. This proof uses the proper- 


101 


4.26 ESTIMATION OF EFFICIENCY 

ties of randomization. It is not the easiest proof, but requires very few 
assumptions. 

Let the experiment have t treatments and r replications, and let 
be the experimental error of the observation on the ith unit, and ry the 
effect of the jth treatment. No assumption is made about the nature 
or distribution of the errors. For this reason it is not necessary to 
introduce any specific symbol for the effect of the replication, since 
any type of effect can be represented by appropriate choice of the 
The treatment effect and the error are assumed to be additive; that is, 
if the randomization happens to put the jth treatment on the ith unit, 
their joint effect is (ry + Throughout the randomization the cy are 
regarded as a set of fixed numbers, each associated with a specific unit. 

Without loss of generality, we may assume that the total of all ob- 
servations is zero, and that the totals of the c’s and the r’s are both zero. 
Let rt t 

i=l 

Further, for all possible randomizations of the randomized blocks design 
the replications will of course remain the same; consequently, for any 
given batch of data the replication totals remain unchanged. Let R de- 
note the sum of squares of these totals, divided by L 

The main part of the proof consists in working out the average values 
of the mean squares in the analysis of variance, taken over all possible 
randomizations of each type of design. The results come out as shown 
in table 4.5. 


TABLE 4.5 Avebage values of mean squabes taken over the 

RANDOMIZATION SETS 

Completely randomized 



di. 

m.s. 

Treatments 

(f-1) 

rT S 

it -1) {rt - 1) 

Error 

t(r ~ 1) 

s 


Replications 

Treatments 

Error 


Randomized blocks 
di. ■ m.s. 


(r - 1) 

1 ) 

(f - l)(t - 1) 


(r - 1) 

(e - 1) r{t - 1) rit - 1) 
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This analysis leads at once to the result. Since 

rih === {t - 1); nt + Ue = T{t - I); 

it is seen from table 4.5 that for randomized blocks the average value 
of riffEb is R, while that of (rii + ne)Ee is (S — fl). It follows that the 
average value of 

uijEh + (% n-i^Ee + S R /S 

‘Ub '•f- ')lt 'h 'I’h 1) 0"^ 

But this is tlie average of the error m.s. for a completely randomized 
design on the same data. 

As an illustration of the details, consider the treatments m.s. for ran- 
domized blocks, which is probably the hardest term. The square of the 
first treatment total is 

Tx^ 5= (m + Cl + 62 + • • • + Gr)" 

When we average over all possible randomizations, there is no contribu- 
tion from terms in rei, since the treatment appears equally often on all 
units, and the total of the 6^s is zero. The average value of the contri- 
bution from the r terms in is the sum of the squares of all the a^s, 
divided by t, or S/ty since the treatment appears on any specific unit 
in a fraction 1/t of all randomizations. 

The contribution from terms in may be written 

H h 6 r) + 62(61 -f 63 -f • • * 6 r) 4 

+ Gr{ei + 62 4 h 6 r.-l) (4.4) 

The randomization is restricted so that every e comes from a different 
replication. If the subscript denotes the replicate, the mean value of 
CiCj is RiUj/i^y where Ri is the replication total of the e's. Consequently, 
the mean value of (4,4) is 

[i2i(i?2 4” H — ' + Rr) + R2iRi 4^ Ua + * * ' 4" Rr) 4- • * ■ 

+ RriRl +R2+---+ Rr-l)] = ~ I + i? 2 " + • • • + R/} = ~ ?- 

G t 

since the total of the Ri is zero, and since R will be recognized as the sum 
of squares of the Ri, divided by t. Hence 

E{T^^) = 

t t 

If this expression is summed for all t treatments and divided by r{t — 1) 
to give the treatments m.s., the result in table 4.5 follows. 
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4.3 Double Grouping: Latin Squares 

4.31 Description. In the latin square the treatm^^^ grouped in to 
replic ates in two different way s. Examples of latin squares are shorn 
for different numbers of treatments in plan 4.1 (p. 119). It will be noted 
that eve ry row and e ve ry column of any square is a complete replicat ion. 
The effect of the double grouping is to el im ina te from the errors all d if- 
ferences amon g rows and equally all differences am ong columns. Thus 
the latin s quare provides more opportunity than rando mized blocks for 
the re duction of errors bv skillful plann ing. 

The experimental material should be arranged and the experiment 
conducted so that the differences among rows and columns repreisent 
major sources of variation. Some ex ample s of the uses of latin squares 
in various fields of research may indicate the utility of the design. In 
field experiments the plots are usually laid out in a square formation, 
so that soil fertility and other variations in two directions are controlled. 
Variations along and across the greenhouse bench may be similarly 
handled in greenhouse experiments. Occasionally the latin square is 
advantageous even when the plots form a continuous line. In this case 
the row’s may be compact blocks of land Avhiie the columns specify the 
order within each block. If the yield gradient is suspected to be in the 
same direction ail along the line, blocks and order in blocks together re- 
move the effects of the gradient more thoroughly than a single control. 
An example of this type is shown below. 

TABLE 4,6 A latin squabk with thh plots in one continuous link 


Seven treatments A, R, C, D, E, Fy 0 

(Experimental limitations force the plot units to go at right angles to gradient.) 
Rep. ! n m IV V VI VII 



Main and Tippett (4.5) describe how 4X4 latin squares were em- : 

ployed in experiments on the weaving of cotton cloth. The purpose of a / 

series of experiments was to investigate the effect of the sizing treatment j 

applied to the wmrp. The criterion was the number of breaks in the warp 
during weaving. Pour warps, each with a different sizing treatment, 
w^ere woven simultaneously on 4 different looms, which could be super- ^ 

vised by a single weaver. Then each warp was moved to a different loom : 

of the set so that after 4 periods every sizing treatment had been tested \ 

on all 4 looms. 1 
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If A, B, C, D represent the 4 warps, the latin square used was as 
follows. 


Periods 

1. 

Looms 

2 

3 

4 

I 

A 

D 

B 

a 

11 

I) 

C 

A 

B 

III 

C 

B 

D 

. A 

lY" 

B 

■ : A - 

c ■ 

D 


This arrangement eliminates constant differences among the looms, 
which were found to be large, and also differeiK3es among the 4 periods 
of wearing. No “period differences were apparent — a result which 
might be anticipated, the authors suggest, because humidity and warp 
tension were conti’olled throughout the experiment. 

Fi^equently, particularly in industry, an experiment requires a series 
of operations each of which may introduce variability into the final re- 
sults. In such eases the latin square may be useful in a preliminarj^ in- 
vestigation of the sources of variation. For example, in the preparation 
of an explosive mixture used in primers, variation may occur either in 
the mixing of the ingredients of the explosive or in the process of charg- 
ing. One experiment of this type involved 4 mixing-blending teams and 
4 charging operators. On each day, the product of each team^was sent 
to a different charging operator, the arrangement being changed daily 
according to the following 4 X 4 latin square (letters IF, X, Y, and Z 
represent the mixing-blending teams). 


Charging operators 



1 

2 

3 

4 

Monday 

W 


X 

Y 

Tuesday 

X 

IF 

Y 

Z 

Wednesday 

y 

X 

Z 

w 

Thursday 

z 

Y 

w 

X 


The latin square analysis of variance enables us to isolate consistent dif- 
ferences amongst the teams and consistent differences amongst the 
chargers, as well as day-to-day variations. 

To quote a more complex example, a 12 X 12 latin square was used 
by Chen, Bliss, and Robbins (4.6) to estimate the toxicities of 12 poisons 
when applied to cats, A single replication of this experiment occupied 
a complete day, and the experiment was repeated for 12 days. Each 
drug was injected into the femoral vein of a cat at 1 cc. per minute until 
the cat died, and the measurement recorded was the dose required to 
cause death. An observer could administer only two of the drugs at a 
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given time. Consequently, three observers v^ere used, each treating 
two cats in the morning and two in the afternoon. The rows of the latin 
square were used to eliminate systematic differences between obseiwers 
and between the morning and afternoon injections. The columns i-epre- 
sent days. The experimental plan was as follows {letters represent 
drugs). 


Time 

Observer 

Day 

j 

1 

2 

3 


5 

I 

7. 

• 8 

9 

10 

U 

12 

10:30 A.M. 

I 

I 

J 

B 

L 

H 

G 

F 

K 

D 

E 

A 

1 C 



K 

a 

J 

H 

I 

B 

L 

C 

E 

F 

D 

i it" 


II 

B 

L 

G 

a 

D 

J 

K 

E 

H 

A 

\ ^ 

, ■ 

1 



E 

D 

F 

G 

J 

K 

A 

L 

^ C 

I 


H 

, 

III 

C 

K 

A 

B 

F 

L 

I 

D 

' G ■ 

H 

,■/ ' 1 

B 



F 

E 

K 

B 

G 

C 

D 

B 


B 

I ;J 

J 

2:30 p.M. 

I 

J 

a ■] 

:E"\ 

K 

A 

1 

•H \ 

F 

B;- 

G 

L 

D 



D 


I ^ 


L 


:c'\ 

G 


:B. ] 

E 

K 


11 

A 

■B"\ 

C ' 


E 

F 

G ! 

H 

1 

J 

K 

L 



H 

E 

L . 

J 

" 1 

.A , 

B , 

I 

K 

D 

G 

F 
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G 

■ I ' ■■ 

D , 

■f' 

K , 

II \ 

J 

■ A'. 

■ E'-. 

C 

E 

B 



L 

.4 

H 

■J:: 

B 

D 

E 

/ 

F 

K 

C 

G 


During the course of the experiment every drug was given equally 
often by each observer and equally often in the morning and afternoon. 
The arrangement is a good example of the way in which a single grouping 
(rows) can control simultaneously more than one potential source of 
variation. All three major variables— day, time of day, and observer — 
would have inflated the experimental error if left imcontrolled. 

4.32 Number of Replications. The chief restriction on the utility of 
the latin square is that the number of replicates equals the number of 
treatments; if the latter is considerable, the number of replications re- 
quired becomes impractical. Squares larger than 12 X 12 are seldom 
used, while the most common range is from the 5 X 5 to the 8 X 8 
square. Latin squares also suffer to some extent from the same dis- 
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advantage as randomized blocks in that the experimental error per unit 
is likely to increase with the size of square; 

The small squares provide only' 

estimation of mToVr-nono with the 2X2, two with the 3 X 3, and six 
witliThe 4 X 4. This fact precludes the use of single 2 X 2 squares, 
while the 3 X 3 and 4 X 4 squares must produce a substantial reduction 
in error over randomized blocks or complete randomization to counter- 
balance the loss of degrees of freedom. More than one square may, 
however, be included in the same experiment. Three 3X3 squares 
(9 replicates} furnish 10 error d.f., while two 4X4 squares give 15 d.f., 
provided that in each case the ^^squares X treatments” interaction can 
bo pooled with error. A considerable number of 2 X 2 squares would be 
required, since the error degrees of freedom are 1 less than the number of 
squares. 

4.33 Randomization, A complete representation of the squares from 
4 X 4 to 6 X 0 and sample squares up to the 12 X 12 is given by Fisher 
and Yates (4.3). For squares up to the 6X6, the randomization pro- 
cedure given in this reference selects (with a minimum of labor) a square 
at random from all latin squares of a given size. For a discussion of the 
theoretical basis of the randomization, see Yates (4.7), 

Examples of latin squares are shown in plan 4.1 (p. 119). The method 
of randomization for plan 4.1 is as follows. 

S XS, Arrange the columns at random and the last 2 rows at random. 
4X4- Select at random one of the 4 squares. Arrange at random all 
columns and the last 3 rows. It is equally good, though not strictly 
necessary, to randomize all rows and columns. 

a X -5' mid higher squares. Arrange all rows, columns, and treatments 
independently at random. ^ 

For 3X3 and 4X4 squares, this procedure selects one square at 
random from all possible squares. For 6 X 5 and larger squares, some 
types of squares have no chance of being selected if plan 4.1 is used. 
Unless latin squares are used very frequently, however, the randomiza- 
tion sets are sufficiently large for experimental plans. 

4.34 Statistical Analysis, Several experiments have demonstrated 
that people find it difficult to select, by personal judgment, unbiased 
samples even from relatively small populations that can be thoroughly 
inspected before selection. In this experiment, each population con- 
sisted of a small area of wheat, containing about 80 shoots, the shoots 
being slightly over 2 feet high. There were 12 samplers, all experienced 
in studying the growth of wheat. Each sampler inspected each area and 
measured the heights of 8 shoots whose heights were to give a represent- 
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ative sample of the shoot heights in the area. The quantity that will be 
analyzed is the difference between the mean height of the 8 selected 
shoots and the true mean height in the corresponding area, in other 
words the sampler’s error. 

The samplers were divided into two groups of six, of which we will 
consider only the first group. The samplers represent the experimental 
treatments. There were 6 areas, each sampled by all 6 samplers in the 
group. These areas, which serve as the replications, may be taken as the 
columns of the 6X6 latin square. The rows of the square prescribed 
the order in which each man sampled the 6 areas (see table 4.7). This 


TABLE 4.7 Sampleb’s errors in shoot heights (centimeters) 

6X6 LATIN SQUARE 


Order 

Archas 

Totals 

! ■■ 1 . 

2 

3 

4 

5 

6 

I 

F+ 3.5 

B+ 4.2 

A +6.7 

D+6.6 

0+4.1 

jEI'+s.s 

+28.9 

II 

B+ 8.9 

1.9 i 

£>+5.8- 

A +4.5 

B+2.4 

0+5.8 ! 

+29.3 

in 

C+ 9.6 

E-\- 3.7 

F-2.7 

B+3.7 

I) +6.0 

A +7.0 

+27.3 

IV 

i)+10.5 

C+10.2 

B+4.6 

B+3.7 

A +6.1 

B+3.8 

+37.9 

V: ■ 

3.1 

A+ 7.2 

0+4.0 

B-3.3 

B+3.5 

D+5.0 

+19.5 

VI 

A+ 5.9 

i)+ 7.6 

B~-0.7 

0+3.0 

F+4.0 

J3+8.6 

+28.4 

Totals 

+41.5 

+34.8 

+17.7 

+18.2 

+25.1 

+34.0 

+171,3 = 


Totals for samplers 

" A T \ ■ 

+36.4 +33.5 ' +36.7 +41.5 +16.0 +7.2 


feature served two purposes. It helped to prevent the men from getting 
in each other’s way, since no two men worked on the same area at the 
same time, and it permitted an examination of the effects of the order of 
sampling on the errors. 

Since only 3 of the 36 errors in table 4.7 are negative, there is evidence 
of a rather consistent tendency towards overestiination of the shoot 
heights. For simplicity, the observations will be referred to as measures 
of bias, though of course they contain a component due to chance 
fluctuations. There is also an indication, on casual inspection of the 
table, that the bias varies from one sampler to another. This point will 
be tested in the analysis of variance. The computations are as follows. 
First find the totals for each row, column, and latin letter and the grand 
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total, 0, shown in table 4.7. The following srans of squares are then 
calculated. 

(171.3)2 


Correction faetor: C ■ 


- 815.10 


36 

Total: (3.5)- -f (4.2)“ -f . • • + (8.6)“ - C = 1144.73 - 815.10 
liows: i[(28.0)“ + . . . + (28.4)“] -- (7 - 843.70 - 815.10 
Columns: ^.[(41.5)*^ + • ‘ + (34.0)2] ^ ^ ^ 893.97 815.10 

Treatments: ^[(36.4)2 + • * • + (7.2)2] 970.70 -- 815.10 


329.63 

28.60 

78.87 

155.60 


The divisor 6 is replaced by r for an r X t square. 

These sums of squares are entered in the analysis of variance. The 
error s.s. is found by subtraction. 

TABLE 4.8 Analysis of variance i?OR a G X 6 latin square 


Source of variation 


d.f. 


s.g 

i. 

m.s. 

F 

Rows (order of sampling) 

(r 

-1) 

- 5 

28. 

.60 

5.720 

1.72 

Columns (areas) 

{r 

- 1) 

- 5 

78, 

.87 

15.774 

4.74 ** 

Treatments (samplers) 

ir 

-1) 

- 5 

155, 

.60 

31.120 

9.35 ** 

Error (r 

- l)(r - 2) 

- 20 

66, 

.56 

3.328 


Total 


-1) 

- 35 

329, 

.63 




To obtain the mean squares, each sum of squares is divided by the 
corresponding number of degrees of freedom. The F value for samplers 
3L120/3.328, or 9.35, well beyond the 1% level, which is 4,10 for 
5 and 20 d.f. This shows that the extent of the overestimation varies 
from one sampler to another. The means for tlie samplers are given 
below. 


A 

4-6.07 


B 

4-5.58 


C 

46.12 


D 

46.92 


■ E 

42.67 


F 

41/20 


The estimated standard orror^of e^h mean is VsVr, where is the 
error m.s., or in this case V (3.328)/6 = 0,745. For testing the dif- 
ference between a pair of means, the standard error is \/2 (0.745) ~ 
1.053. Since the 5% t value for 20 d.f. is 2.086, the difference between 
two means must be at least (2.086) (1 .053) , or 2.20, in order to attain 
significance at this level It appears that the samplers fall into two sets — 
Bj Cj D and F, the biases being significantly smaller for E and F 
than for the others. 

Some further information about the nature of the biases can be ob- 
tained from an examination of the row, and column totals. From table 




f 

r: 


I- 
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4.8 the F value for rows is seen to be 1.72, which is below the 5% value, 
2.71* No clear effect of the order in which the areas were sampled has 
therefore been established; further, the row totals (table 4,7) do not sug- 
gest any consistent trend. 

On the other hand, the F value for the columns m.s., 4.74, is significant 
at the 1% level, so that the bias varies from area to area. Previous 
observations had indicated that the bias might depend on the mean shoot 
height of the area. The column totals are ranked below in order of in- 
creasing true mean shoot height. 



Trae mean 

Column 


shoot height 

total 

Area 

(centimeters) 

(centimeters) 

1 

59.0 

+41.5 

2 

66.2 

+34.8 

6 

72.3 

+34.0 

4 

74.5 

+18.2 

6 

76,0 

+25.1 

3 

76.4 

+17.7 


Apparently on areas where the shoots are not so high the positive bias 
is greater tlian on areas with relatively high shoots. The result is not 
surprising, since the sampler had to stoop in order to do his work. The 
linear regression of the column totals on the true mean heights contrib- 
utes 59.88 to the sum of squares for columns, 78,87. The mean square 
for deviations from this regression (4 d.f.) is only 4.75, which is not sig- 
nificantly greater than the error m.s. The inference is that the large 
value of the mean square for columns is due mainly to the negative 
correlation between the bias and the mean height. For further discussion 
of this experiment, see (4.8). 

The analysis of two 4X4 squares is discussed in (4.9). A numerical 
example with five 3X3 squares will be found in (4.10). It should be 
noted that with several latin squares the interaction of treatments with 
squares may be tested. If there is no reason to expect that this inter- 
action is real, the corresponding sum of squares may be combined with 
error, 

■ 12^2 ... 

4,36 Standard Errors. The formula — gives the estimated standard 


error of the difference between two treatment means, where is the 
mean square per unit and r the number of replicates. In cases where 
heterogeneity of errors is suspected, the error cannot be vsubdivided mth 
the same ease as in randomized blocks. It is possible to remove from the 
error s.s. the contribution of any one of the treatments^ Yates (4.11). For 



y = 


t{ r + c + r) - 20 

(r - l)(r - 2) 


where R,G^T, and 0 are respectively the totals of the row, column, and 
treatment which contain the missing value and the grand total The 
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dj ““ Ri “j"* Gi ‘^Vi 

where Ri and Ci are the corresponding row and column totals, and r is 
the number of treat ments. The sum of squares of deviations of the 
quantities dj, dividixl by r(r •— 2), is the contribution of this treatment 
to the error s.s. and has (r — 1) degrees of freedom. For illustration, 
siqipose that we wish to calculate the contribution of sampler F to the 
error in section 4.34. From table 4.7 the first d TOlue is 

di - (+28.9) + (+41.5) - 6(+3.5)+ +49.4 

and the remaining values will be found to be +52.7, +61.2, +57.5, 
+29.5, +49.1, with a total +299.4. The sum of squares of deviations 
the. mean is 610.34. Divided by 24, this gives 25.43, with 5 d.f., 
for the contribution of sampler F to the error s.s. Consequently the 
error s.s. (table 4.8) may be divided as shown below. 



di. 

s.s. 

m.s. 

Total error 

20 

66.56 

3.328 

Contribution of F 

5 

25.43 

5,086 

Eomainder 

15 

41.13 

2.742 


The error m.s. for the suspected treatment may be compared with 
mean square for the remaining (r — l) (r+- 3) degrees of freedom, 
to test whether the treatment shows abnormally high variation. In 
this example the variance ratio is 1.85, with 5 and 15 di. If it is con- 
cluded that the treatment is unusually variable, the mean square with 
(r — 1) (r — 3) degrcies of freedom is used as error for the other treat- 
ments. It should be stressed that the ordinary variance-ratio tables can. 

used in this test only if the decision to make the test was taken before 
examining the individual results. If the test is made merely because one 
treatment looks anomaloius on inspection, different significance levels 
that take account of this fact are needed, as discussed in section 3,53. 

4.36 Missing Values. Except for changes in the formulae, the pro- 
cedure is similar to that foi* randomized blocks (section 4.25). A single 
missing value is substituted by means of the formula, reference (4.4) 
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each yield yi of the treatment, calculate the quantity 
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estimated standard error of the difference between the corresponding 
treatment mean and the mean of a treatment with no missing values is 

Allien several values are missing, repeated application is made of for- 
mula (4.5) as described in reference (4.4). When values have been in- 
serted for all missing data, the usual analysis of variance is calculated. 
One degree of freedom is subtracted from the error d.f. for each missing 
value. 

The exact formula for the standard error of a treatment mean is eom- 
jolex. Yates (4.4) proposed a useful approximate rule, which shows the 
number of replicates to be assigned to any treatment mean for a com- 
parison with another treatment mean. Each observation on one treat- 
ment is given one replication if the other treatment is present in both 
the corresponding row and column. The replication is % when the 
other treatment is missing in the row or column and is when the other 
treatment is missing in both the row and column. When the treatment 
itself is missing, the replication is 0. Although complicated at first sight, 
the rule is not difficult to apply and is reasonably accurate. Consider for 
instance the following 6 X 6 square with 3 units missing (two B’s and 
one AJ) as indicated by the parentheses ( ). 


B 

E 

C 

F 

D 

A 

F 

D 

(B) 

E 

A 

C 

C 

•A . ' 

E 

D 

B 

F 

m 

C 

A 

(B) 

F 

D 

A 

F 

D 

C 

E 

B 

D 

B 

F 

A 

C 

E 


Taking the yields of B and B in the order of the rows, we ascribe the 
following numbers of replicates. 

B: "I- h 0 4“ 1 h 0 ■+• 1 ~h 1 == 3-^ = 

1 h h ‘3' h 0 h 1 h 

Hence the standard error of the difference between the treatment means 
is a/ F or the difference between A a nd B, the reader may 
verify that the standard error is V + x)* Notice that the ef- 
fective replication for a treatment may change from one comparison to 
another. 

The analysis required when a single row, column, or treatment is miss- 
ing is described by Yates (4.11). Wien more than one row, column, or 
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treatment is missing, consult Yates and Hale (4.12). These methods 
are also presented by DeLury (4.13). 

4.37 Estimation of Efficiency. The effectiyeness of either the row or 
the coiiimn grouping may be tested ironi the results of latin SQiiaies. 
The expression 

4~ 4" lie 

is an estimate of the error m.s. which would have been obtained if the 
row grouping had not been used, i.e., if the design had been randomized 
Idocks with the columns as blocks. In the formula, Er and Ee are the 
mean scjuarevS for rows and error, respectively, and 7irj iio He are the num- 
bers of degrees of freedom for rows, treatments, and error. The in- 
creased munber of degrees of freedom with randomized blocks is taken 
into account by the method of section 2.31. The effectiveness of the 
colamn grouping is tested similarly. 

A 5X5 experiment on potatoes will serve as an illustration. Tlie 
mean .squares for the yields of potatoes were as follows. 


Rows 

Columns 

Treatments 

Error 


di. 

4 

4 

4 

12 


m.s, 

62.6 

104.7 

272.0 

26.2 


If the experiment had been conducted in randomized blocks with the 
columns as blocks, the estimated error m.s, would be 

4(62.6) 4- 16(26.2) 


20 


33.5 


The degrees of freedom available for error would increase from 12 to 16. 
To take account of the advantage of the additional degrees of freedom 
(section 2.31), we reduce 33.5 by the multiplier (13)(19)/(17)(15), or 
0.969. This gives 32.5 as a comparable mean square for randomized 
blocks. The row grouping increased the information by an estimated 
25%. Of course a considerable number of these comparisons would be 
needed for drawing general conclusions. 


4.4 Cross-over Designs 

4.41 Description. In dairy husbandry and biological assay a design 
has been used which closely resembles the latin square but may have 
some advantages when the number of treatments is small. ^ In the sim- 
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plest case where there are two treatments A and the units are first 
grouped into pairs as if a randomized block design were to be used. 
Suppose that from previous knowledge one unit in each pair is expected 
to give a higher response than the other and that the difference in favor 
of the superior unit is expected to be about the same in all pairs. It will 
clearly be advisable to ensure that each treatment is applied to the 
“better^^ member in half the replicates and to the “poorer^ member in 
the other half. The pairs or replicates, which must be even in number, 
are divided at random hiio two equal sets, the firvst set to receive treat- 
ment A on the superior member of each pair, the second to receive 

The experiment discussed in chapter 1 provides an example. The 
data (shown in table 4.10) are the times required to compute the sums 
of squares of 27 observations on each of two machines whose speeds it was 
desired to compare. Ten sums of squares were calculated, making 10 
replications of the experiment. The cross-over design was used because 
it was thought that the second computation of a sum of squares might 
be faster than the first, so that for a fair comparison each machine should 
be used first in five of the ten replications. The randomization assigned 
machine A to be used fiivst in replications 1, 3, 6, 8, and 9. 

The degrees of freedom in the analysis of variance are as follows, 

■ d.f. ■ 


Columns (pairs or replicates) 9 

Rows 1 

Treatments 1 

Error 8 


Total 19 

Note that only 1 d.f. is assigned to rows. That is, the arrangement re- 
moves from error only the average difference between the 2 rows. In so 
far as the difference is not constant but varies from one replication to 
another, this variation enters into the experimental error. 

In some experiments it is known that this difference will vaiy, and it 
is possible to estimate in advance (at least roughly) whether the difr- 
ference will be large or small for a given replicate. Suppose that the dif- 
ference is expected to decrease steadily from replicate 1 to replicate 10. 
With this knowledge a more accurate design is obtained by the use of 
five 2X2 lathi squares, as shown below. 


TABLE 4.9 Five 2X2 i.atin bquabes for 2 treatments in 10 replicates , 


Rows I II 

12 3 4 

Better A B A B 

Poorer BA BA 


Square 

III IV 

5 6 7 8 

BA A B 

A B BA 


V 

9 10 

A B 

B iM'A 
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In this arrangement we remove from error 5 d.f. for rows, one in each 
sciuare. Thus not only the average difference is removed, but also the 
variation in tlie difference from square to square. 1 he combined anal^^sis 
of variance of the 5 squares is set out as follows. 


Squares 

Colimms within squares 
Rows within, squares 
Treatments 
, Error . 

Total 19 

The 9 d.f. for squares and columns within squares are exactly the same 
as the 9 d.f. for columns in the cross-over design. Owing to the more 
complete elimination of the row effects, there are only 4 d.f. for error 
instead of 8 with the cross-over plan. 

To summariige, the cross-over design is particularly appropriate v/hen 
the difference betw^een the rows is substantially the same in all replicates, 
for in this case the whole of the real difference between rows is con- 
centrated in the single degree of freedom and the error variance is no 
larger than that of the latin squares. Even if the difference bet\veen rows 
is known to be variable, the cross-over design may be preferable in small 
experiments where few degrees of freedom are available for error. 

Tn dairy husbandry the cross-over may be used to compare the effects 
of two feeding rations on the amount and quality of milk produced by 
the cow. Since cows vary greatly in their milk production, each ration 
is tested on every cow by feeding it during either the first or the second 
half of the period of lactation, so that each co%v gives a separate replicate. 
The milk yield of a cow declines sharply from the first to the second half 
of its period, so that the first half is ahvays ^ ^better,” in the sense above. 
Whether a cross-over is superior to a set of latin squares depends on 
circumstances. The rate of decline is not constant from cow to cow; it is 
greater in general for high-yielding than for low-yielding cows. Thus if 
previous production records are available, the cows may be divided into 
pairs on the basis of yielding ability, each pair being made a separate 
2X2 latin square. This plan is likely to give a smaller error than the 
cross-over, though sometimes not sufficiently smaller to counterbalance 
the loss of degrees of freedom. 

The design can be used with any number of treatments, subject to the 
restriction that the number of replicates must be a multiple of the num- 
ber of treatments. With three treatments, for example, a plan can be 


d.f. 

4 

5 
5- 
1 
4 
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drawn up from the 3 cycles ABC^ EGA ^ CAB^ where the order of the 
letters denotes the row to which each treatment is applied. Each cycle is 
allotted a.t random to one-third of the replicates. For higher numbers of 
treatments a design is constructed in the same way from the columns of 
any latin square. When the number of treatments exceeds four, how- 
ever, the degrees of freedom for error are sufficiently large so that a set 
of latin squares is usually preferable. 


4.42 Statistical Analysis. All sums of squares are calculated in the 
usual way. For the example (table 4.10), the computations are as fol- 
lows. 


Correction factor: 


( 352)2 

20 

3 -4- I 


Total: (30)2 ^ ( 14)2 (21)^ + . . . -f (23)' 


Columns: |•[(44)2 + (42)^ -)- 
(194 - 158)2 


•+( 47 ) 2 ] 


Bows: - 


Treatments: 


20 

(216 - 136)2 
20 


« 6195.2 


*- 6195.2 - 910.8 
6195.2 - 357.8 

64.8 
= 320.0 


TABLE 4.10 Ckoss-ovee experiment for comparing the speeds of two 

COMPUTING MACHINES A : AND B 


Time (seconds minus 2 minutes) taken to calculate a sum of squares 


Rows 

Columns (replications) 

Totals 

I- 

2 

1 ■ 3'"; 

4 

5 

' 6 + 

7 

8 

9 

10 

First (poorer) 

A30 

B21 

A22 

ma 

RIB 

A29 

B 7 

A12 

423 

J924 

IM 

Second (bet- 












ter) 

R14 

A21 

B 5 

A22 

A18 

B17 

+16 

B14 

B S| 

"A2Z- 

,158^ 

Totals 

44 

42 

27 

35 

31 

; ■■ ^ 

23 

26 

I -31 

47 



Treatment totals: A 216; B 136 


Analysis of variance 


Source of variation 

d.f. 

s.s. 

m.s. 

Columns (replications) 

9 

357.8 

39.8 

Rows (first versus second) 

1 

64.8 

64.8 

Treatments 

1 

320.0 

320.0 

Error 

8 

168.2 

21.0 

Total 

19 

wo. 8 




The error s.s, is found by siibtraetioin ine p-muo lor 
significant at the .1% level 

We will calculate the confidence limits for the true difiei’ence in speed 
which were quoted in section 1.2. The average observed difference is 
8.0 seconds in favor of machine B. The estimated standard error of this 
difference is A/2(¥u3f^^ based on 8 d.f. Consequently the 

confidence limits for the true difference are 

8.0 d: 2.049^^ 

"where is the value of Student^s f corresponding to (1 — P) , "where P is 
the confidence probability chosen. For P = .95, f — 2.306, and the 
limits are (8.0 ± 4.7), or 3.3 and 12.7. For P = .80, f - 1.397, and, 
for P === .99, f = 3.356, giving confidence limits of (5.1, 10.9) and (1.1, 
14.9), respectively, as quoted. 

Worked examples for the case where there are two treatments are also 
given by Brandt (4.14) and Fieller (4.15); the latter also illustrates the 
analysis of covariance for this design and the procedure when one column 
is missing. 

In general, if there are i treatments and r replicates, the degrees of 
freedom subdivide as follows. 



d.f. 

Columns 

(r- 1) 

Hows 

(t - 3) 

Treatments 

(f-1) 

Error 

(t - l)(r - 2) 

Total 

(tr - 1) 


4.43 Standard Errors. The usual formula 


holds for the standard 


error of the difference between two treatment means. 
The substitution formula for a missing value is 


V =- 


a-l)(r - 2) 


(4.7) 


where the capital letters refer respectively to the totals of the column, 
row, and treatment in which the missing value appears and to the grand 
total The difference between the mean of the affected treatment and 
the mean of a treatment with no missing value has a standard error 


(4.8) 


4,51 
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4.6 Triple Grouping : Graeco-latin Squares 

4.61 Description. In this arrangement the treatments are grouped 
into replicates in three different ways with the consequence that the ef- 
fects of three different sources of variation are equalized for all treat- 
ments. 

TABLE 4.11 A 5 X 5 GEAEGO-LATIN SQUARE 


Litters 

a 

Pea (weight groups) 
h c d 

e 

I 

Ai 

Ss 

C5 

T>2 

Ei 

11 

B2 

Ci 

£>1 


A. 5 

in 

Cs 

D, 

^2 

Ai 

Bi 

IV 

Di 

El 

As 


Cs 

' i 

Ef, 

As 

B.1 

Cl 

A 


The additional grouping (usually represented by greek letters) is de- 
noted here by subscripts. Each treatment (A, -- £") appears once in 

each row and colimin and once with each subscript (1 v- *5). 

One example is an arrangement proposed by Dunlop (4,16) for 
testing 5 feeding treatments on pigs. The arrangement requires 5 pigs 
from each of 6 litters (I, 11, HI, IV, and V), the effects of litter dif- 
ferences being equalized by making litters correspond to the rows of the 
square. The animals are housed in 5 pens (a, b, o, rf, and e) which con- 
stitute the columns. The columns are used also to control differences in 
weight; i.e., the heaviest animal in each litter is assigned to the first pen, 
the next heaviest to the second pen, and so on; consequently the columns 
eliminate simultaneously the pen differences and the principal variations 
in w^eight within litters. Thus far the design is an ordinary latin 
square. 

The subscripts signify the positions of the five feeding crates within 
each pen. This extra control, though possibly unnecessary, ^vas sug- 
gested for several reasons. The first and fifth crates in each pen were of 
different construction from the other crates; moreover the pigs occupying 
the end crates have less “company” when feeding. 

The graeco-latin square has not been used often, probably because the 
units can seldom be balanced conveniently in all three groupings. In the 
example discusKsed above, the practical difficulty was to secure five litters 
of which each contained five or more pigs. An interesting industrial ap- 
plication is described by Tippett (4.17). ^ 
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Tlie designs have been constructed for all numbers of treatments from 
3 to 12, except 6 and 10; examples are shown in plan 4.2 (p. 120). 

4J2 Randomization. Arrange the rows and columns independent^ 
at random. Assign the latin letters and the subscripts at random to 
their respective classifications. 

4.53 Statistical Analysis. The sums of squares for rows, columns, 
treatments, and suliscripts are all obtained in the usual way. For an 
r X r sciuare the error has (’r — l)(r — 3) degrees of freedom. This 
number is rather inadequate when r is less than 6. 
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Plan 4-i {Continual) Selected latin squiires 
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Plan 4.S Graeco-lathi squares 
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Plan {Contmmed) Graeco-latiii squares 
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CHAPTEE, 5 


FACTORIAL EXPERIMENTS 
6.1 Description 

6.11 A 2^ Factorial Experiment. In a factorial experiment the effects 
of a number of different factors are investigated simultaneously. The 
treatments consist of all combinations that can be formed from the dif- 
ferent factors. To illustrate the simplest case, consider an experiment 
on sugar beet witli 2 factors. These were nitrogen, none (no) versus 3 
cwt. sulphate of ammonia per acre (ni), and depth of winter ploughing 
(7 in. versus 11 in.). Ploughing took place in late January, the nitrogen 
was applied in late April, and the seed was Ksown early in May. Since 
both factors occur at 2 levels or variations, the experiment is described 
as a 2 X 2 factorial experiment. The 4 treatment combinations are 
shora below, with the mean yields of sugar per acre (cwt.) underneath. 

Treatment combinations and yields of sugar (cwt. per acre) 

I 2 3 4 

(/lo, 7 in.) (ni, 7 in.) (no, 11 in.) (ni, 11 in.) 

40.9 47.8 42.4 50.2 

The 3delds may be placed in the following 2X2 table. 


. ■ , , ^ ^ ."j 

Depth 

Nitrogen 
no ni 

Mean 

liespoiivse 

tom 

Tin. 

40.9 

47.8 

44.4 

+6.9 

11 in. 

42.4 

50.2 

46.3 

+7.8 

Mean 

41.6 

49,0 

45.3 


11 in. minus 7 in. 

+1.5 

+2.4 




The results might be summarized as follows. Considering the effect 
of nitrogen, we might report that the application of nitrogen increased 
yields by 6.9 cwt. with shallow ploughing and by 7.8 cwt. with deep 
ploughing. These figures are called the simple effects of nitrogen. They 
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represent the type of information that would be wanted, for instance, 
in giving advice to a farmer -who always used shallow ploughing but Avas 
doubtful whether to apply nitrogen. For the simple effects of depth of 
ploughing, we might report that 11 in. ploughing was superior to 7 in. 
by 1.5 cwt. in the absence of nitrogen and by 2.4 cwt. when nitrogen was 
applied. 

There is another way of looking at the results. It sometimes happens 
that the effects of the factors are independent By this we mean that the 
response to nitrogen is the same whether ploughing is shallow or deep, 
and that the difference between the effects of deep and shallow ploughing 
is the same whether nitrogen is present or not. In this event the two sim- 
ple effects of nitrogen, 6.9 cwt. and 7.8 cwt., are estimates of the same 
quantity and differ only by experimental errors. On this supposition we 
would naturally average the two figures in order to estimate the response 
to nitrogen. The average, 7.4 cwt., is called the main effect of nitrogen. 
It can be derived alternatively as the difference between the two column 
means in the table, 41.6 and 49.0. Similarly the main effect of depth of 
ploughing (11 in. minus 7 in.) is the average of 1.5 cwt. and 2.4 cwt., or 
1.9 cwt. Note that a main effect, being an average of the simple effects, 
is more precisely estimated than the latter. In this experiment the 
standard error of a main effect is l/^/2 times that of a simple effect. 

Consequent^, if we are sure that the factors operate independently, 
the summary that was given above in terms of simple effects may be 
replaced by another that is both more concise and more accurate. This 
might read as follows. ^ 'The application of nitrogen increased the yield 
of sugar by 7.4 cwt., while 11 in. ploughing increased the yield by 1.9 
cwt. as compared with 7 in. ploughing.” It is worth repeating that when 
the factors are independent the figure 7.4 cwt. is the best estimate not 
onl}’' of the. average response to nitrogen, but also of the response on plots 
ploughed to 7 in. and of that on plots ploughed to 11 in. In other words, 
the whole of the information in the experiment is contained in, the main 
effects. 

The question arises: How do we know whether the factors are inde- 
pendent? Frequently the answer is suggested by knowledge of the 
processes by which the factors produce their effects. In the present case 
an agronomist might reason that deep ploughing should enable tlie plant 
to develop a more vigorous root system. With this the plant shoidd be 
able to utilize more effectively any added nutrient such as nitrogen. 
Thus he might predict that the response to nitrogen would be greater 
with deep than with shallow ploughing, though he probably would not 
expect it to be much greater. In short, he would predict that the two 
factors would not be quite independent in their effects. 
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III addition to the information that may be available from such 
reasoning, a factorial experiment itself provides a test of the assumption 
of independence. For, if the depth of ploughing does affect the response 
to nitrogen, the difference between 7.S cvd. (the response to nitrogen 
with deep ploughing) and 6.9 cwt. (the response with shalloAV plough- 
ing) is an estimate of this effect. The difference, 0.9 cwt., can be tested 
in the usual way by a t-test If it proves significant, the assumption of 
independence is rejected by the data. The difference (sometimes divided 
by a numerical factor) is called the interaction between nitrogen, and 
depth of ploughing, 

Iiiterchanging the roles of the two factors, we may also consider 
whether the superiority of deep over shallow ploughing is affected by the 
presence of nitrogen. To measure the interaction in this case, we sub- 
tract 1,5 cwt. (superiority of deep ploughing Avhen no nitrogen is added) 
from 2.4 c’wt. (superiority when nitrogen is added). The difference is 
again 0.9 cwt. It is easy to see that this equality always holds with a 
2 X 2 experiment. Each difference is equal to the sum of the observa- 
tions in one diagonal of the 2 X 2 table, minus the corresponding sum 
for the other diagonal. Such interactions are called two-factor , or first- 
order, interactions. 

5*12 Advantages of Factorial Experimentation When Factors Are 
Independent* The advantages of factorial experimentation naturally 
depend on the purpose of the experiment. We suppose for the present 
that the purpose is to investigate the effects of each factor over some pre- 
assigned range that is covered by the levels of that factor which are used 
in the experiment. In other words the object is to obtain a broad picture 
of the effects of the fac-tors rather than to find, say, the combination of 
the levels of the factors that give a maximum response. One procedure 
is to conduct separate experiments each of which deals only with a 
single factor. Another is to include all factors simultaneously hy means 
of a factorial experiment. 

If all factors are independent in their effects, the factorial approach 
will result in a considerable saving of the time and material devoted to 
the experiments. The saving follows from two facts. First, as we have 
seen, when factors are independent all the simple effects of a factor are 
equal to its main effect, so that main effects are the only quantities needed 
to describe fully the consequences of variations in the factor. Secondl3q 
in a factorial experiment, each main effect is estimated -with the same 
precision as if the whole experiment had been devoted to that factor 
alone. Thus, in the preceding example, half the plots receive nitrogen 
and half do not. Consequently, the main effect of nitrogen is estimated 
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just as precisely as it would be in a simple experiment of the same size 
devoted to nitrogen alone. The same result holds for the effect of depth 
of ploughing. The two single-factor experiments would require twice the 
total number of plots in order to equal the precision obtained by the 
factorial experiment. If there were n factors, all at two levels and all 
independent, the single-factor approach would necessitate n times as 
much experimental material as a factorial arrangement of equal precision. 
The gain from factorial aiTangements m this case is very substantial. 

Practical considerations may diminish this gain. The experimenter 
frequently lacks the resources to conduct a large experiment with many 
treatments, and must proceed with only one or two factors at a time. 
Further, it has been pointed out previously that, as the number of treat- 
ment combinations in an experiment is enlarged, the standard error pei: 
unit increases. This standard error is therefore likely to be higher for a 
large factorial experiment than for a comparable single-factor experi- 
ment. This increase in standard errors can usually be kept small by the 
device known as confounding, described in chapter 6. 

5.13 Factorial Experimentation When Factors Are Not Independent, 
We assume that the purpose is still to investigate each factor over the 
range represented by its levels. When factors are not independent, the 
simple effects of a factor vary according to the particular combination 
of the other factors with which these are produced. In this case the 
single-factor approach is likely to provide only a number of discon- 
nected pieces of information that cannot easily be put together. In 
order to conduct an experiment on a single factor A, some decision must 
be made about the levels of other factors B, C, B, say, that are to be 
used in the experiment (e.g., whether all plots should be ploughed 7 in., 
9 in., or 11 in. deep m an experiment on nitrogen). The experiment re- 
veals the effects of d. for this particular combination of B, C, ancM), but 
no information is provided for predicting the effects of A with any other 
combination of B, (7, and B. With a factorial approach, on the other 
hand, the effects of A are examined for every combination of B, C, and 
B that is included in the experiment. Thus a great deal of information is 
accumulated both about the effects of the factors and about their inter- 
relationships. 

In this connection, Fisher (5.1) has pointed out that it is sometimes 
advisable to introduce into an experiment an extra factor tha,t is not it- 
self of interest, in order that the experiment may form the basis for 
sounder recommendations abotit the other factors, In agricultural ex- 
perimentation in Britain, farmyard manure has served as a subsidiary 
factor of this kind. Any recommendations made to farmers about other 
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factors will be put to the test in some fields where the farmer has ap- 
plied manure and in others where he has not, so that it is well to in- 
vestigate the other factors in both the presence and the absence of ma- 
nure. 

5.14 Summary Comments. To summarize, the following are some in- 
stances where factorial experimentation may be suitable. 

1. In exploratory work where the object is to determine quickly the 
effects of each of a luimber of factors over a specified range. 

2. In investigations of the interactions among the effects of several 
factors. From their nature, interactions cannot be studied without test- 
ing some of the combinations formed from the different factors. Fre- 
(luently, information is most quickly obtained by testing all combina- 
tions. ' 

3. In experiments designed to lead to recommendations that must 
apply over a wide range of conditions. Subsidiary factors may be 
brought into an experiment so as to test the principal factors under a 
variety of conditions similar to those that will be encountered in the 
population to which recommendations are to apply. 

On the other hand, if considerable information has accumulated, or 
if the object of the investigation is specialized, it may be more profitable 
to conduct intensive work on a single factor or on a few combinations of 
factors. For instance, some investigations are directed towards finding 
the combination of the levels of the factors that will produce a maximum 
response. An interesting discussion of procedures for this purpose has 
been given by Friedman and Savage (5.2). They consider the case, 
common in industrial experimentation, where the effect of any specified 
treatment combination can be determined quickly. Thus the treatment 
combinations to be tested can be decided as the experiment proceeds, in 
the light of knowledge gained from combinations that have already been 
tested. They propose first a single-factor experiment to determine the 
optimum level ai of A for fixed levels 6, c, and d, say, of the other factors. 
This is followed by a single-factor experiment to determine the optiimmi 
level 6i of B for fixed values ai, c, and d. Similarly, C is tested at Ui, 6i, 
dj and D at ai, 6i, ci. After the completion of the first 'h'ouiid/' the 
whole process is repeated until the maximum appears to have been 
reached. The authors show that the maximum will usually be reached 
more quickly than with a complete factorial, since the plan is designed 
to concentrate on combinations that are near the maximum. The plan 
would not be feasible for agricultural field experiments, where in general 
an experiment can be changed only once a year. 

Experimenters sometimes find the results df factorial experiments 
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difficult to interpret, because tliejr appear to present a bewildering ya- 
riet 3 ^ of treatment comparisons. It is true that the competent summary 
of a large factorial experiment demands an orderly procedure and often 
takes considerable time. If the factors are for the most part independent, 
the method of analysis by means of main effects and interactions (to be 
illustrated later) will reduce the data to manageable proportions. If 
the numerous factors interact in a pu 2 :ding manner, prolonged study of 
the results and further experimentation may be needed before the facts 
are mastered. The trouble in this case is that the phenomena are com- 
plex, not that the experimentation is faulty. 


6.2 Calculation of Main Effects and Interactions 

6.21 Notation for the 2^*^ Series. The object of section 5.2 is to ex- 
plain how main effects and interactions are calculated, and how they are 
represented in the analysis of variance of the results. We begin with the 
2^ series, where each factor occurs at only two levels. 

For this system the notation used is similar to that of Yates (5.3). 
Letters /I, B, (7, • • • denote the factors. The letters a, 6, c, ••• denote 
one of the two levels at wffiich the corresponding factor occurs; for pur- 
poses of clarity this level will be called the second level The first level 
is signified by absence of the corresponding letter. Thus the treatment 
combmation hd, in a 2^ factorial experiment, means the treatment com- 
bination which contains the first levels of factors x4. and C, and the second 
levels of factors B and D. The treatment combination which consists of 
the first level of all factors is denoted by the symbol (1). 

The symbol (ab) will denote the memi of all observations which receive 
the treatment combination ab. The letters A, B, and AB, when they 
refer to numbers, will represent, respectively, the main effects of A and 
B and the A by B interaction. 

5.22 The 2^ Factorial Experiment. As already shown in section 5.11, 
we have 

A = -|-[(a6) - {h) + (a) - (1)] 

B = ■|-[(a6) + (b) - (a) - (1)1 

Yates (5.3) introduces the same multiplier -g- for the interaction, which 
he defines as 

AB = i[(ab) - (b) - (a) + (1)] 

These quantities and the general mean M are shown in terms of the 
means for the treatment combinations in table 5.1. 
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TABLE 5.1 Main EFFECTS and interactions expressed in terms of 

INDIVIDUAL TREATMENT means: 2" FACTORIAL 


Factorial 

effect. 

Treatment combination 

Divisor 

(1) (ct) (6) (a>>) 

M 

\ 4- 4" 4- 4" 

4 

A 

1 . - . 4- 

2 

B 

— — 4- 4” 

' 2 

AB 

+ — — 4* 

2 


The rows of the table express the factorial effects in terms of the 
original means. If the equations represented hy the table are solved for 
(a5), {b)y etc., in terms of Jf, Ay etc., it will be found that the columns 
of the table enable us to express the original means in terms of the 
factorial effects. For example, from the column for (a&), 

(ab) - M + i[A +B + AB] 

The only point to remember is that the factor ^ occurs with all terms 
except M. From these results, simple effects may be calculated from 
factorial effects. Thus 

(a) — (1) — simple effect of A. when B is at the first level = ^4 — AB 

(ah) — (6) — simple effect of A when B is at the second level = .*4 + ^ B 

These relations are useful when an experiment has been siunmarized in 
terms of the factorial effects, and it is later desired to estimate some of 
the simple effects. From the above example it may be noted that the 
quantity AB measures the error that is committed in estimating the 
simple effects of .4 if the two factors are erroneously assumed to be in- 
dependent. 

The functions Aj B, and AB satisfy the conditions for an orthogonal 
set of functions (section 3.42). Consequently the squares of the fac- 
torial effects, when suitably multiplied, divide the treatments s.s. into 
three single components, each with 1 d.f . In practice these components 
will usually be calculated from the treatment totals [ah]y [6], etc., rather 
than from means. If we define factorial effect totals as illustrated below, 

[1] =' [ab] ^ [6] + H ^ [1] 
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then the sum of squares for .4 in the analysis of variance is [4.]^/4r, where 
r is the number of replicates. Corresponding formulae hold for the other 
factorial effects. 

5 6.23 The 2^ Factorial Experiment. In this case there are treat- 

ment combinations: (1), a, h, c, ab, ac, be, and abc. The simple effect of 
I -4 is determined for each of /o?/r combinations of the other factors: (1), 

bj c, and be. As before, the main effect of A is defined to be the average 
of these four simple effects. 

A = ^[{ahe) - (be) + (ah) - (b) + (ac) - (c) + (a) - (1)] 

Similar definitions hold for B and C. 

The interaction of A with B is now measured separatelji^ at each of the 
two levels of C. 

AB (C at second level) = ^[(abc) — (be) — (ac) + (c)] 

AB (C (it ikBi hrel) ^ 

As would be expected, the quantity AH is taken as the average of these 
two effects. Thus 

AH -i[(ate) -- - (b) - (a) + (1)1^^ \ 

There are two other first-order interactions, AC and BC, which are de- 
fined similarly. 

In addition, we encoimter a new interaction. Separate estimates were 
given above for AH at each of the two levels of C, The difference be- 
tween these two estimates measures the effect of C on the AH inter- 
action. This difference, ^mth. the conventional factor I*, is 

|•[(a&G) - (be) - (ac) + (e) - (ab) + (h) + (a) - (1)] ^ 

and may be called the interaction of AH with C. If the algebra is carried 
out, it wall be found that the same expression measures the interaction 
of AC with H, and that of BC with A, Hence the quantity is called the 
ABC interaction. It is a three-factor^ or second-order^ interaction. ^ 
Three-factor interactions are more difficult to understand than twpr 
factor interactions. Fortunately, in practice three-factor interactions 
are often small relative to main effects and two-factor interactions; 
and quite frequently they can be neglected for the purposes to which the 
results are to be put. Occasionally, cases arise where they are important. 
It might happen, for instance, that factor A does not exert any influence 
unless factors H and C are present in the combination (bo). In this event 
the interaction ABC is as large as the main effect of A or the interactions 
AH and AC. The same effect may occur in less extreme eases, where the 
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combination be is particularly favorable to the response to factor A,' If 
three-factor interactions are found to be substantialj a careful scrutiny 
of the simple effects is usually helpful in interpretation. 

The expressions for the factorial effects in terms of the treatment 
means are summarized in table 5.2. The rows of the table give the 

TABLE 5.2 Main effects and interactions expressed in terms of 

INDIVIDUAL TEBATMSNT MEANS: 2® FACTORIAL 


Factorial 

effect 

j 

Treatment combination 

Divisor 

(1) 

(a) 


(ab) 

(C) 

(ac) 

(be) 

(abc) 

M : 

4- 

+ 

+ 

+ 

+ 

4- 

+ 

4" 

8 

A 


4- 


-f 

- 

-j- 

— 

■f 

4 

B 


, - 


-f 

— 

— 

4- 

4- 

i' ■ ■ 4 . ' 

C 


- 

- 


4" 

4- 

+ 

4" 

4 

AB 

+ 



+ 

.4- 

— 

- 

+ 

4 

AC 

+ 


-f 

- 

— 

+ 

— 

4“ 

4 

BC 


+ 

— 

— 

■— ■ 

— 


4* 

4 

ABC 

— 

+ 



4- 



4* 

4 

1 


factorial effects in terms of the treatment means, while the cohimns give 
the treatment means in terms of the factorial effects. For example, 

(a.) - M + - B - C - AB - AG + BC + ABC] 

(a) - (1) = A - AB -AC + ABC 

(abc) -(!)= A+i? + C' + ABC 

As before, the factor ^ appears with all terms except M in the expression 
for a treatment mean. The difference between two treatment means, of 
two examples are given above, is easily found by noting the signs 
in the two columns in question. 

As the reader may verify, the 7 factorial effects are mutually orthog- 
onal, and each is orthogonal to M. If r is the number of replicates, each 

' 8cr® ' 

factorial effect has the same variance, , or — . The contribution of 

i^r 2r 

any effect to the sum of squares for treatments is [ ]^/8r, where [ ] 
denotes the factorial effect tofafj e.g., 

fi] d~ [«] — [6] [oh]:^ — [60] :+ [ohe] 
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5.24 The 2^'' Factorial Experiment. A few formulae v/ill be given. 
Avhich apply to any number of factors, all at two levels. With n fac- 
tors, there are n main effects, n(n — l)/2 two-factor interactions, 
n(n — l)(n — 2)/6 three-factor interactions, and so on. The successive 
numbers are the coefficients in the expansion of (1 + 1)", omitting the 
first coefficient, unity. 

Tavo equivalent methods for Avriting out any factorial effect in terms 
of the original treatment means are aA^ailable. Each is simple to re- 
member, though the actual AATiting may take some time in a large ex- 
periment. They Aviil be illustrated by finding the BCDE interaction in 
a 2^' experiment, AAuth factors A, j5, C, D, E, and F, 

Rule E Evens versus odds. In every factorial effect, half the treatment 
combinations receive a + sign and half a — sign. Those Avhich receive 
a + sign are those AAfiiich contain an even mxxnbex of the letters that ap- 
pear in the factorial effect. In the BCDE interaction 4 letters, b, c, d, 
and e, appear. There are 8 combinations that contain an even number of 
letters: (1), he, hd, he, cd, ce, de, Rxidhdce, Each of these can be combined 
Avith any one of 4 combinations of the remaining letters a and /, nameh^, 
(1) , a, /, and af. Hence, the 32 terms that appear with a + sign are as 
follows. 


0 ) 

a 

/ 

af 

he 

abc 

hej 

ahef 

hd 

ahd 

hdf 

abdf 

he 

abe 

bef 

ahef 

cd 

tml 

cdf 

aedf 

ce 

ace 

cef 

acef 

de 

ade 

def 

adef 

bede 

ahede 

bedef 

ahedef 


The 32 combinations Avith a — sign are those AAhich have an odd num- 
ber of the letters 5, c, d, or e. In detail, these are 


h 

ab 

¥ 

ahf 

c 

ac 

«/ 

acf 

d 

ad 

df 

adf 

e 

ae 

¥ 

aef 

bed 

abed 

hedf 

ahedf 

bee 

abce 

beef 

abcef 

bde 

abde 

hdef 

abdef 

ede 

aede 

cdff 

aedef 


When calculated from the treatment means, this difference betAveen the 
sums for the tAvo groups of 32 combinations is divided by 2^"“^ to give the 
BCDEeSect, 
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Ruk 2. Algebraic. la this rule the SCDJ? interaction is expressed 
formally as 

BCDE = (a + 1)(& - l)(c - 1)(<^ - 1)(« ~ !)(/ + 

Note that - signs appear with the factors that enter into the interaction 
and + signs with those that do not. If this expression is expanded al- 
gebraically, it gives the interaction as a linear function of the treatment 
means. 

As in the. 2^ and 2® cases, all factorial effects are orthogonal to one an- 
other and to the mean. The contribution of any effect to the sum of 
squares in the analy,sis of variance is [ ■where [ ] denotes the ef- 

fect total. 

6.26 Factors at More than Two Levels: a 4 X 2 Factorial. Various 
notations are used. For example, the 3 levels of a factor A may be de- 
noted by oo, ai, a^, or by a.i, a^, as, or simply by the niunbers 0, 1, 2. For 
illustration, we give below the treatment totals in a 4 X 2 experiment 
on sugar cane. The treatments W'ere 4 levels of dressing of potash, ko, 
ki, fcsi and ks, in arithmetic progression, and 2 levels of phosphate, po 
and pi- There were 5 replicates in randomized blocks. 


TABLE 5.3 Total -nELDS of sugab (tons per acre) 



^0 

h 

/C2 

^3 

Total 

PQ 

180 

248 

277 

285 

990 

Pi 

251 

307 

342 

346 

1246 

Totals 

431 

655 

619 

631 

2236 


The main effect of phosphate (P), which occurs at only 2 levels, is of 
course derived from the comparison of the 2 marginal totals, 990 and 
1246. For the main effect of potash (K) there are 4 marginal totals 
which may be compai’ed. . It will be recalled (section 3.42) that 3 in- 
dependent comparisons may be made amongst 4 totals, and that an in- 
finite number of such sets of 3 may be chosen. Thus the main effect of 
K comprises 3 independent comparisons. 

With the 2” system, we were able to define every main effect as a spe- 
cific linear combination of the treatment means. In the present case we 
could select a particular set of 3 independent comparisons, each of which 
would be a specific linear combination of the treatment means. These 
could be defined as the “components" of the main effect of K. However, 
the particular set that is most useful for the interpretation of the results 
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will change from experiment to experiment, so that a foi'mal definition 
of this type would be of limited utility. The experimenter should use 
whichever set appears most relevant. . 

Now consider the interaction between P and K, From the differences 
between the two rows in table 5.3, the effect of P is estimated separately 
at each level of if, the estimates being 

ko hi h<2, hz 

(Vi -Po) 71 59 65 61 

Any comparison among these 4 figures is a measure of the effect of if on 
the response to P, and therefore is a component of the interaction be- 
tween P and If. Consequently the interaction between P and If con- 
sists of 3 independent comparisons, and is said to have 3 d.f. As with 
the main effect of /f, the particular set of 3 components that will be of 
interest varies with the type of experiment. 

We may also wish to consider the interaction as the effect of P on the 
response to Jf . As we have seen, the response to if has 3 components. 
Since there are 4 increasing levels of potash dressing, we might choose as 
components the linear, quadratic, and cubic components of the response 
curve. Apart from a divisoiv the linear component is 

-3(180) 1(248) + 1(277) + 3(285) - 344 

at the lower level of P and 

-3(251) - 1(307) + 1(342) + 3(346) - 320 

at the higher level of P. The difference between these 2 figures, —24, 
estimates the effect of P on the linear response to if, and is a part of the 
interaction between if and P. The other two components are the ef- 
fects of P on the quadratic and cubic responses to Jf. 

In a 2^^ system, the interaction between P and if is identical, with the 
interaction between if and P. In the more general case the correspond- 
ing result is that any component of the interaction between if and P can 
be derived from the components of the interaction between P and if. 
For example, the effect of P on the linear response to if can be vTitten 

-24 - -3(71) - 1(59) + 1(65) + 3(61) 

so that it is a linear function of the responses to P at the 4 levels of Jf. 
In this more general sense, the two interactions are still equivalent. 

We will consider the analysis of variance of the 8 treatment totals in 
some detail. With 5 replicates, the total s.s., on a single-plot basis, is 
(180)2 + (251)2 + (248)2 ,+ ...+ (346)2 (2236)2 



The 7 di. subdivide into 1 for the main effect of P, 3 for that of K, and 
3 for the PK interaction. In practice the first two terms are calculated 
directly and the interaction obtained by subtraction. The computations 
are 

(1246 - 990f 


" + (555)2 ^ (619)2 + (631)2 (2236)2 


= 1638.4 


2518.4 


10 40 

TABLE 5.4 PKELIMINARy ANAIjYSIS of vaeiance of eata in table 5.3 



d.f. 

s.s. 

m.s. 

F 

1 

.1638.4 

1638.4 

K 

3 

2518.4 

839.5 

PK 

3 

8.4 

2.8 

Total 

7 

4165.2 



As is common in agricultural experiments, the interaction m.s. is small 
compared mth those for the main effects (in fact it is below the mean 
square for error) . 

As an exercise we will divide the K main effect and its interaction wffh 
P into linear, quadratic, and cubic components. The first step is to cal- 
culate the totals for these effects separatel3!^ at each level of P. The re- 
sults are shown below, the multipliers for the 4 levels of K being shown in 
parentheses. These multipliers are obtained from a table of orthogonal 
polynomials (5.9). 


Ki 

R, 

Xc 

(-3, -1, +1, 4-3) 

(+1, -1, -1, +1) 

(-1, +3. - 

4344 

- 60 

418 

4320 

- 52 

-10 

4664 

-112 

4 8 

-- 24 

4 8 

-28 


+ ?>o) 

(pi - po) 

From the sum line, we obtain the 3 components of the K main effect, 
= ^( 664)2 = 2204 . 5 , K, = ,^( 112 )" = 313.6 

the df(re?-e?ice line, the components of the P7C interaction 
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PKi = ^(24)2 = 2.9, PKa = ^(8)2 = 1.6 
P Ac — ■2’^^(28)2 = 3.9 
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The divisors are found from the usual rule for lii>ear functions (section 
3.42). The three K components add to 2518.4, the total s.s. for Kj while 
the PK components add to 8.4, the sum of squares fowPK. The detailed 
analysis of variance is shown in table 6.5. 

TABLE 5.5 More detailed analysis of variance of the data 
IN table 5.3 



d.f. 

s.s. or m.s. 

p 

' ■ 1 ■ 

1638.4 

Ki 

1 

2204.5 

IQ 

1 

313.6 

Ke 

1 

0.3 

PKi 

1 

2,9 

PK^ 

1 

1.6 

PKc 

1 

3.9 


Since every component has 1 d.f., the mean squares are the same as 
the sums of squares. The error m.s. is 16.35, with 28 d.f. The quadratic 
component oi K is significant, indicating a falling off in the response at 
the higher levels of application. Neither the cubic component nor any 
of the interactions approaches significance, 

5.26 A 3^ Factorial Experiment. As a second example, the treatment 
totals are shown in table 5.6 for an experiment with 3 levels of nitrogen 
fertilizer and 3 of phosphate fertilizer. The data are the numbers of 
lettuce plants that emerged from the ground and are totals over 12 plots 


TABLE 5.6 Numbers op lettuce plants emerging * 



no 

ni 

n2 

Totals 

Vo 

449 (Aa) 

413 (C/3) 

326 (By) 

1188 

Pi 

409 (BIS) 

358 (Ay) 

291 (Ca) 

1058 

P2 

341 iCy) 

278 (Ba) 

312 (Al3) 

931 

Totals 

1199 

1049 

929 

3177 


, * The use of the latiri and greek letters will be explained later, 

each. Both nitrogen and phosphate appear to have had a deleterious 
j effect on emergence (the subscript 2 denotes the largest application). 

} The main effects of N and P both comprize tw^o independent com- 

parisons, and thus have 2 d.f. each. Since the amounts of N and P were 
in arithmetic progression, it would probably again be appropriate, as in 
the previous example, to choose the linear and quadratic components of 
the regression on amount of dressing as the individual components of the 
main effects. 
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Tiie Ni component can be estimated separately at each level of P. 
This means that there are 2 d.f. which measure the effect of P on the 
linear response to N, These are a part of the A^P interaction. Another 
2 d.f. are supplied by the effect of P on the JV,, component, so that the 
NP intei’action contains 4 d.f. 

From inspection of the margins of table 5.6, it is evident that the main 
effects of both A and P are approximately linear. Consequently, the 
most interesting single degree of freedom from the interactions is likely 
to be the interaction of Ni with Pi. In many agricultural experiments 
this type of interaction is the only one that approaches significance. It 
is worth while calculating this interaction separately in the analysis of 
variance. The other three components, NiPg, NgPi, and NgPg, will also 
be obtained. The initial computations appear in table 5.7. 


TABLE 6.7 Calculation of lineae and quade-atic effects fob the 

ANALYSIS OP VAEIANCB 


po 

Pi 

P2 

Ni 

(-1,0,1) 

•-123 

•-118 

Na 

(1, -2, 1) 

— 51 
-16 
+ 97 

% 

■ + ' ' 

712 

■■ 

Pi 

(-1,0, 1) 
-108 

135 , 

- 14 

Pa 

(1, -2, 1) 
-28 
-25 - 
+56 

Sum 

-~270(A^i) 

+ m(Ng) 

Sum 

-267(P() 

+ 3(P„) 

p.r-- 

+ U(NiPi) 

+148(i7,Pi) 

.Ni ' 

+ 94(PilVd 

+mPgNl) 

Pa 

-4- 84(iV'^P^) 

+ 78iNaPa} 


+ U8(PlNg) 

+78(PgNg) 


The left side of the table shows the Ni and Ng effects for each level of 
P, while the right side shows the Pi and Pg effects for each level of N. 
For instance, from table 5.6, 

NiPo = 326 - 449 - -123; PgUo = 449 - 2(409) + 341 = -28 

The column sums give the individual components of the N and P mam 
effects. Now consider the difference between the third and the first row. 
For the Ni column, this difference (+94) gives the linear effect of P on 
N : i.e., the NiPi interaction. From the Pi column, the difference gives 
the linear effect of N on Pi, or the PiNi interaction, which is exactly the 
same as the NiPi interaction. The other two columns provide the NgPi 
and the NiPg effects. 

Finally, the snm of the first and third rows, minus twice the second 
row, leads to the components of interaction that contain a quadratic 
term. It will be seen that all four components can be obtained from 
either the left or the right half of the table, so that in practice only one 
half is required. By computing both halves we verify the symmetry of 
the components with respect to iV and P, 
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The squares of these quantities, with appropriate divisors, will give an 
analysis of variance of the 8 di. amongst the treatment totals into 8 
single components. Since an individual entry in table 5.6 is the total 
of 12 plots, the divisors may be verified to be as shown below^ 



Ni or Pi 

Nq or Pq 

NiPi 

NiP, 

N^Pi 


Divisor 

72 

216 

48 

144 

144 

432 


For instance, the divisor for may be worked out as follows. The 
three Nq figures in table 5.7 (i.e., —51, —16, +97) each have divisor 
12(1^ + 2^ + 1^), or 72. Since the NqPq total is a linear function of 
these three figures, with coefficients 1, —2, and 1, this total has divisor 
72(P + 2^ + 1^), or 432. The analysis of variance is shown in table 5.8. 

TABLE 5.8 Subdivision of the treativient s.s. 



d.f. 

s.s. or m.s. 

Ni 

T 

1012.60 

Na 

1 

4.17 

Pi 

1 

917.35 

P, 

1 

0.04 

NiPi 

1 

184.08 

NiPc, 

1 

49.00 

h\Pi 

1 

152.11 


1 

14.08 


The error m.s. in this experiment is about 59. The linear effects of both 
fertilizers are significant, with no indication of curvature. The NiPi ef- 
fect is significant at the 10% level, but not at the 5% level. 

In table 5.6, p. 135, latin and greek letters were superimposed so as to 
form a 3 X 3 graeco-latin square. This square leads to another method 
of calculating the sum of squares for the interactions (4 d.f.). Although 
the method is not likety to be of use for purposes of interpretation, it has 
formed the basis of some ingenious devices in the construction of de- 
signs. In the square, the column totals represent the main effects of N 
and the row totals those of P, Since the latin letter totals are orthogonal 
to rows and columns, it seems reasonable to suppose that they must 
represent two of the 4 components of the NP interaction. Similarly, 
the greek letter totals represent the remaining 2 components. These 


totals are shown below. 





A B C 

Total 

a 


y, ; Total 

1119 1013 1045 

3177 

1018 

1134 

1025 3177 
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Each figure is now a total of 36 plots. The sum of squares of deviations 
of the latin letter totals, divided by 36, is 164.22, and the corresponding 
figure for the greek letters is 235.06. These add to 399.28, which is the 
same as the total s.s. for the interactions in table 5.8, apart from round- 
ing differences. 

6.27 General Method of Analysis. Suppose that there are three fac- 
tors, A, Cj which occur at a, jd, and y levels respectively. The main 
effects have (a - 1), - 1 ), and (t - 1 ) components or degrees of 

freedom respectively. Each component of the main effect of A can be esti- 
mated separately at each of the /3 levels of B. Thus each component of A 
contributes (p - 1) degrees of freedom to the AB interaction. This means 
that the AB interaction contains a total of (a — 1)(/? — 1) components 
or degrees of freedom. Similarly the AC interaction has {a — 1)(7 — 1 ) 
degrees of freedom, and the BC interaction has - 1)(7 — 1) degrees 
of freedom. 

To compute the sums of ^squares for the main effects and first-order 
interactions, wc form two-way tables for each pair of factors. Consider 
the A by B two-way table. The total s.s. among cells has (afi — 1) de- 
grees of freedom. From the marginal totals in the table we compute 
the sum of squares for the main effect of A, with (a ~ 1 ) degrees of free- 
dom, and that for the main effect of S, idth (j3 — 1) degrees of free- 
dom, By subtraction, the sum of squares for the AB interaction, with 
(a — 1 ) (jd — 1 ) degrees of freedom, is obtained. 

There remains the three-factor, or ABC, interaction. Now each of 
the (a — l)(/3 1 ) components in the AB interaction is estimated sep- 

arately at each level of C. It will therefore contribute (7 — 1) degrees 
of freedom to the ABC interaction, so that the latter contains in all 
(a — .l)(/d — 1)(7 — 1) degrees of freedom. The sum of squares for ABC/ 
is also obtained most easily by subtraction. From the total s.s. amongst 
treatments, with {a^y — 1 ) degrees of freedom, subtract the sums of 
squares for A, B, C, AB, AC, and BC. The remainder will be the sum of 
squares for ABC. 

It is hoped that the reader will find no difficulty in extending these 
methods to the case where there are more than three factors. In gen- 
eral, the sums of squares for main effects are calculated directly, and those 
for interactions are calculated by subtraction. To compute an ABCD 
interaction, for instance, we require a four-way table for the four factors 
represented. From the sum of squares for this table we subtract the sum 
of squares for all main effects and two- and three-factor interactions 
among the factors in question. Calculations "must be checked by re- 
computation. Of course, if the experimenter subdivides any interaction 
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into single components as in the previous section, a check is provided by 
this process. 

In the general case it remains true that any component of a factorial 
effect is orthogonal to any component of any other factorial effect. Thus 
I ■ any component of the BC interaction is orthogonal to any component of 

the A main effect, or of the BCD interaction, etc. Two components of 
I the same factorial effect may or may not be independent. For instance, 

if A. occurs at three levels, the comparison (02 — %) is independent of the 
compaiison (02 — 2ai + gq), but is not independent of ~ ao). 

The following selected references contain a discussion of factorial ex- 
periments 'with some worked examples. 

5.3 Yates, F. The design and analysis of factorial experiments. Imp. Bur. Soil 
ScL Tech. Comm. 35, 1937. This gives the most comprehensive account that 
I is available, with numerous worked examples. 

- 5.4 Yates, F. Complex experiments. Jour. Roy. Stut Soc. Suppl. 2, 181“247, 

1935. An earlier reference, with examples of 2^, 2^, 4 X 3, and 3® factorials. 
5.5 Tippett, L. H. C. The methods of staUstics. Williams and Norgate, London, 
2nd ed., 1937. The use of factorial designs in industrial experimentation is in- 
dicated. A worked example is given of the breakdown of the treatment s.s. 
in a 5 X 4 X 3 X 2 factorial in single replication. 

■ 5.6 Lindquist, E. F. Statistical analysis in educational research. Houghton 
Mifflin, Boston, 1940. The application of factorial designs in experiments on 
methods of teaching is described, with a worked analysis of variance for a 
4 X 3 design completely randomized. 

5.7 Bliss, C. I. Factorial design and covariance in the biological assay of vitamin 
■ D. Jour. Amer. Stat. Assoc. 35, 498-506, 1940. Worked example of a 3 X 2 

■ ■ ' design.' 

5.28 Interpretation of the Analysis: First Example. The separation 
of the treatment comparisons into main effects and interactions is a con- 
venient and po^verful method of analysis in cases where interactions axe 
small relative to main effects. When interactions are large, this analysis 
must be supplemented by a detailed examination of the nature of the 
interactions. It may, in fact, be found that an analysis into main effects 
^ and interactions is not suited to the data at hand. There is sometimes a 

tendency to apply the factorial method of analysis mechanically without 
considering whether it is suitable or not, and also a tendency to rely too 
much on the initial analysis of variance alone when writing a summary of 
the results. Below two examples are presented where the initial analysis 
of variance is not very informative, and where the results can be sum- 
marized better in terms of simple rather than factorial effects. 

The first experiment was conducted by the Wailuku Sugar Company. 
i . . .. Three varieties of sugar cane were compared, in combination mth three 

? ” levels of nitrogen (150, 210, and 270 lb. N per acre respectively). The 
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crop was the second harvesting, or the first ratoon crop. In table 6.9 
only the relevant part of the analysis of variance is shown: i.e., the suni 
of squares for the main effects of V and N and for the VN interaction, 
plus the error s.St The data are in tons of cane per aci'e. The conclusions 

TABLE 5.9 Analtsis of vaeiancb of a 3 X 3 sugar cane experiment 



d.f. 

S.S.. 

m.s. 

F 

.'■2 

319.38 

159.69* 

N 

2 

56.54 

28.27 

VN 

4 . 

550.79 

139.95 * 

Error 

24 

1053.84 

43.91 


from the analysis of variance are that the main effects of V and the VN 
interaction are both significant, but there is no sign of a main effect due 
to N. This statement tells little about the results of the experiment. 

The treatment totals (over 4 replications) are shoivn b elow w ith their 
standard eiTors. Since the standard error per plot is 43.91 = 6.626, 
the standard error for a treatment total is 13.3 as shown. 

TABLE 6.10 Treatment totals (tons) in a 3 X 3 sugar cane experiment 

(±13.3) 



no 

ni 

?12 

Total 

S.e. 

Vi 

260.1 

275.9 

303.8 

845.8 


V2 

245.8 

250.2 

281.7 

777.7 

d=23.0 

n 

274.4 

258.1 

231.6 

764.1 


Total 

s.e. 

786.3 

784.2 

d=23.0 

817.1 

2387.6 



Instead of having no effect, nitrogen has apparently given a steady in- 
crease in yields with the first two varieties, but a steady decrease with the 
third variety. Further, the significant main effects of varieties apply 
only to the average varietal yields over all 3 dressings of N, and not to 
yields with a particular rate of dressing. On the average Vi gives a sub- 
stantially higher yield than V 3 , but at the lowest level of N, V 3 is slightly 
above vi. 

The subsequent analysis may be made either by means of <-tests ap- 
plied to table 5.10 or by means of a further subdivision of the analysis of 
variance. It is of interest to examine the response to N for each variety 
separately. To test the linear responses by means of a Uest, we require 
the standard error for (% - no) as computed for each variety. This 
standard error is 13.3 X 1.414, or 18.8, and since the 5% lvalue for 24 
d.f. is 2.064, the quantities (ns - no) must attain the value 18,8 X 2.064, 
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or 38.8, in order to be significant at the 5% level The actual values are 
(n2 no) vi = +37.7; ^0)^2 +35.9; 

(n2 — no)P3 ^ —42.8 

Thus neither of the increa,ses with vi and % quite reaches the significance 
level, though both are close to it. The decrease with ^3 is significant. 

The test of linearity of the response curves is made by means of the 
quantity (^2 + Uq — 2ni), ealculatecl for each variety. The standard 
error of this quantity is 13.3 X or 32.6. The reader may verify 
from table 5.10 that none of the curvature terms even exceeds its stand- 
ard error, so that there is no occasion to reject the hypothesis that the 
responses are linear. 

It is not quite so clear w+at tests are appropriate for appraising the 
varietal differences. Since, however, the difference between and ^2 is 
very consistent at all levels of AT, a t-iest of the total difference is sug- 
gested. This difference is 68.1, and since the value required for signifi- 
cance is 23.0 X 1.414 X 2.064, or 67.1, the superiority of over is 
Just significant. Further, it is evident on inspection that % does not dif- 
fer significantly from the other varieties at either of the two lower levels 
of At the highest level, V3 is significantly below both i/j. and v^. 

When interactions are large, much care is required in the preparation 
of a statement that summarizes the results, and it is not easy to reach a 
form that is free from criticism. The following is a suggestion for this 
example. 

“The increase in yield of cane to the highest dressing of N (270 lb. per 
acre) over the lovrest dressing (150 lb, per acre) was 9.4 tons per acre with 

and 9.0 tons per acre with V 2 - Both increases Just failed to be signifi- 
cant at the 5% level. With ^3, on the other hand, the highest dressing 
of N decreased the yield significantly by 10.7 tons per acre as compared 
with the lowest dressing. For all three varieties the effects of N appeared 
to be proportional to the amount applied, within the range investigated 
in this experiment. 

“Variety 1 gave a higher yield than variety 2 for all levels of AT, the 
average difference, 5.7 tons, being just significant at the 5% level The 
yields for variety 3 did not differ significantly from those of the other 
varieties at the two lower levels of N. At the highest level of Nf the yield 
for variety 3 was significantly lower than that for either of the other va- 
rieties/^ 

5.29 Interpretation of the Analysis : Second Example. This example 
is more complex, mainly because the factorial (a 3 X 3) is of an unusual 
type. The data come from a long-term experiment on meadow hay, 



conducted at Lady Manner’s School, Bake^rell, England, with the co- 
operation of Rothamsted Experimental Station. The yields are for the 
1937 season. The two factors are shoum schematically below. 


First factor (3 levels) 
f No manure 
j Mixed artificials 
[ 8 tons compost 


Applied in 
1936 


Second factor (3 levels) 
f No manui'e 


I Mixed artificials 
1 8 tons compost J 


Applied in 
1937 


The nature of the experiment may become clear from a discussion of some 
of the individual treatment combinations. In any replicate there are 
nine plots, of which three received mixed artificial fertilizers in 193/ . 
In the previous year, 1936, one of these three plots received no manure, 
one received mixed artificials, and one received compost. Thus these 
three plots enable us to compare the remkial effects of the 1936 appli- 
cations of artificials and compost, on plots which received artificials in 
the current year, 1937. Similarly, from other treatment combinations, 
we can compare the residual effects of artificials and compost on plots 
which were umnanured in the current year and on plots which received 
compost in the current year. 

Further, in any replicate there are three plots whicjh were unmanured 
1936. Of these, one had no manure, one had artificials, and one had 
compost in 1937. Consequently we may also assess the direct effects of 
artificials and compost applied in 1937 on plots which were unnianured 
in the previous year, and likewise on plots which received artificials or 
compost in the previous year. Both the artificials and compost contain 
the three common plant nutrients, nitrogen, phosphorus, and potash. 

The system of treatments is an ingenious one, designed to measure 
direct and residual effects at the same time. The experiment was started 
in 1932, but for simplicity we mil ignore any effects of treatments applied 
to 1936. 

The treatment means and the analysis of variance are shown in tables 
1 and 5.12. The analysis of variance was calculated from the treal*^ 
ment means, rather than from single plots, and the error m.s. has been 
adjusted so as to apply to a treatment mean. The direct effects are 
highly significant, the interactions are significant at the 5% level, but 
residual effects are not significant. In considering the nature of the ef- 
fects leading to these results, it is again convenient to think in terms of 
simple effects rather than of main effects and interactions. It is worth 
n oting th at the standard error of a single entry in the two-way table is 

V 6 666, or 2.58, while that for the difference between two entries is 
3.66. 
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With regard to residual effects, table 5dl shows that coinpost pro- 
duced a large and highly significant increase, 13.4 cwt. =b 3.65, on the 
plots that received no manure in the current year. On plots that re- 
ceived manure in 1937, either artificials or compost, there is no suggestion 
of any residual effect of compost. None of the residual effects of arti- 
ficials approaches the level required for significance. These results are 

TABLE 5.11 Teeatment mean yields (hay, cwt. pee acee) (±2.58) 


Residual effects of 1936 treatments 
Direct effects of r- — — , 


1V}37 tre.atments 

None 

Artificials 

Compost 

Mean 

None 

53.6 

56,8 

67.0 

59,1 

Artificials 

80.8 

82.3 

80.5 

81.2 

Compost 

74.3 

69.1 

70.0 

71.1 

Mean 

69.6 

69.4 

72.5 

70.6 


TABLE 5.12 Analysis of vakiance of tbeatment means 



d.f. 

s.s. 

m.s. 

Direct effects 

:2 , 

732.28 

366.14 ** 

liesidual effects 

■ '.2 '■ 

18.24 

9.12 

Interaction 

4 

97.01 

24.25 

Error 

'■ 24 - - 

159.74 

6.656 


in accord with general fertilizer experience, since a compoKSt is more 
likely to give residual effects than an inorganic fertilizer, and since re- 
sidual effects would be expected to show up most clearly on plots which 
have no manures during the current year. 

So far as direct effects are concerned, artificials were superior to com- 
post whatever the residual treatment. 

1936 manuring 1937 artificials — 1937 compost (cwt.) 

No manure 4-6.5 

Artificials +13.2 

.'Compost . ■ ■;+10.5:..' . 

The differences among these three figures are within the limits of ex- 
perimental error. To test this point, we find the sum of squares of 
deviations of these differences, which comes to 22.73. This is divided by 
2 to make it comparable with the analysis of variance, giving 11,36, 
which represents two of the 4 d.f. for interactions. The mean square, 
5.68, is slightly below the error m.s. Thus the superiority of artificials 
appears to be consistent. 
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The itiereases for artificials and compost over no manure are much re- 
duced on plots that received compost in the previous j^ear. 


1936 manuring 


Compost 

Artificials and none 
(averaged) 


(1937 artificials - 1937 none) (1937 compost - 1937 none) 


+13.5 

+26.4 


+3.0 

+16.5 


The striking reduction on plots with 1936 compost is simply a reflection 
of the na.ture of the residual effect of compost as previously examined. 

The conclusions from the experiment might be summarized as follows. 

''Artificials applied in 1937 increased yields by 13.5 cwt. on plots that 
received compost in the previous year, and by 26.4 cwt. on plots that did 
not receive compost in the previous year. The corresponding increases 
due to 1937 compost were 3.0 cwt. and 16.5 cwt., respectively. The su- 
periority of 1937 artificials over 1937 compost, which averaged 10.1 
cwt., appeared to be independent of the type of manuring during the 
previous year. 

“As regards residual effects, compost applied in 1936 increased yields 
by 13.4 cwt, on plots that were immanured in 1937, but gave no apparent 
increase on plots manured in 1937. There were no significant residual 
effects of 1936 artificials.^^ 

It seems evident that the significant interaction m.s. in the analysis 
variance must arise mainly from the fact that compost had a residual 
effect only when no manure was applied currently. As an exercise it may 
be instructive to isolate the part of the interaction s.s. that is due to this 
effect. Since artificials appeared to have no residual effect, we will com- 
bine the 1936 artificials with the 1936 unmanured plots. Consequently, 
the residual effects of compost are obtained from the comparison 

2(1936 compost) — (1936 artificials) — (1936 none) 

quantity may be calculated separately for each of the 1937 treat- 
ments. Its values are 


(1937 none) - +-23.6; (1937 artificials) -2,1; 


(1937 compost) == —3.4 

The contrast bet^ween the residual effect of compost on plots without 
1937 manures and that on plots with 1937 manures may be estimated 
from the comparison 

2(+23.6) - (-2.1) - (-3.4)-= +6^^ 
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The square of this quantity, with a suitable divisor, is a single com- 
ponent of the interaction s.s. Since the divisor for the quantities 23.6, 
etc., is 6, the divisor needed is 6 X 6, or 36. Hence the sum of squares 
for this component is (52.7)^/36, or 77.15. The remaining 3 di. for in- 
teractions .have a sum of squares equal to 19.86. The mean square, 
6.62, is no larger than the error m.s. This verifies the suggestion that 
the significance of the interaction m.s. can be attributed to the type of 
residual effect of compost. 

Some readers may have difficulty in satisfying themselves that the 
component isolated above really is a part of the interaction s.s. It may 
be helpful to express the basic quantity, 52.7, as a linear function of the 
original mean yields in table 5.11. The multipliers of the means will be 
found to be as shown below. 


1937 treatments 

None 

1936 treatments 

Artificials 

Compost 

None 

-2 

-2 , 

4 

Artificials 

1 

1 

~2 

Compost 

1 

1 , 

--2 


The sums of the coefficients are zero over every row and column so that 
the expression is orthogonal to both sets of main effects. Hence it must 
be a part of the interaction. The coefficients also enable us to verify the 
divisor, 30, Avhich is equal to the sum of the squares of the coefficients. 

Since the interactions can be attributed to the behavior of a single 
group of plots (those with compost in 1936 and no manuring in 1937), it 
might be suspected that this treatment had been allotted by chance a 
favorable set of plots. The residual increase, 13.4 cwt., does seem rather 
large in relation to the direct effect of compost. An examination of 
previous results does not lend much weight to this suspicion. Over the 
4 preceding years, the average residual response to compost, with no 
ciurent manure, was 11.1 c^vt. In 2 years, 1935 and 1933, the plots in- 
volved in this comparison were the same as those in 1937. For these 
years the average residual effect was 6.2 cwt. In the other 2 years, when 
a different set of compost plots is involved, the average residual effect 
was 16.0 cwt. 

The two preceding examples are intended to illustrate the fact that 
the most informative subdivision of the treatment comparisons depends 
on the type of experiment. An analysis copied from a model that ap- 
pears similar in form may be inappropriate or even meaningless. The 
experimenter should first decide which comparisons are necessary for the 
interpretation of the results. The subsequent analysis, either by Wests 



FACTOEIAL EXPERIMENTS 6.29 

or by subdivision of the analysis of variance, should be directed towards 
these comparisons. 


5.3 Designs for Factorial Experiments 

5.31 Factorials in Complete Block Designs. Moat types of experi- 
mental plan are suitable for factorial experunents. In particular, if the 
total munber of treatment combinations is not large, the designs de- 
scribed in chapter 4 are frequently used. The relative advantages of 
complete randomization, randomized blocks, and latin squares are the 
same with factorial as mth non-factorial sets of treatments. For il- 
lustration, we show arrangements for a 4 X 2 factorial (i) in 8 random- 
zed blocks and (ii) m an 8 X 8 latin square. In the former case, only 
the first replicate is given. 

TABLE 5.13 Factom.il exi’ebimbnt abbanged in (i) eandomizbd blocks 

AND (ii) A LATIN SQUARE 

8 treatment combinations 


8 replications 


i. In randomized blocks 
Rep, I 
12 


In a latin square 


A at 4 levels (1,2, 3, 4) 

B at 2 levels (1,2) 

Analysis of variance 
Replications 7 

Treatment combinations 7 

A 3 

B 1 

AB 3 

Error 49 

Total 63 


Analysis of variance 
Rows 7 

Columns 7 

Treatment combinations 7 

A 3 

B 1 

AB 3 

Error 42 

Total ti3 


In the next section, a worked example of the analysis of a large fac- 
torial experiment in randomized bloclcs is presented. 
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5,32 4 X 4 X 3 FACTORIAL IN RAND0MI2;ED BLOCKS 


TABLE 5.14 Weights of benervated (y) and corresponding normal (x) 
MUSCLE (unit — 0.01 gram) 

Rep, I Number of treatment periods daily 


Length of 

One (ai) 

Three (ag) 

Six (ao) 

t-\f 'J. .jl - ^ 

VX VXXXXJXX\:f ^JL J \jx 




* 

, 

— — — j 

(minutes) 

current 

y 

■■ X 

y 

X 

V 

X 

KW 

Galvanic 

72 

152 

74 

131 

60 

131 


Faradic 

61 

130 

61 

129 

65 

126 


60 cycle 

62 

141 

65 

112 

70 

111 


25 cycle 

85 

147 

76 

125 

61 

130 

2 (ha) 

Galvanic 

67 

136 

52 

110 

62 

122 


Faradic 

60 

111 

55 

180 

59 

122 


60 cj^'cle 

64 

126 

65 

190 

64 

98 


25 cycle 

67 

123 

72 

117 

60 

92 

3(63) 

Galvanic 

57 

120 

66 

132 

72 

129 


Faradic 

72 

165 

43 

95 

43 

97 


60 cycle 

63 

112 

66 

130 

72 

180 


25 cycle 

56 

125 

75 

130 

92 

162 

5(65) 

Galvanic 

57 

121 

66 

160 

78 

135 


Faradic 

60 

87 

63 

115 

58 

118 


60 cy cle 

61 

93 

79 

126 

68 

160 


25 cycle 

73 

108 

86 

140 

71 

120 

Rep. II 

l(6i) 

Galvanic 

46 

97 

74 

131 

58 

SI 


Faradic 

60 

126 

64 

124 

52 

102 


60 cycle 

71 

129 

64 

117 

71 

108 


25 cycle 

53 

108 

65 

108 

66 

lOS 

2 (62) 

Galvanic 

44 

83 

58 

117 

54 

97 


Faradic 

57 

104 

55 

112 

51 

100 


60 cycle 

62 

114 

61 

100 

79 

115 


25 cycle 

60 

105 

78 

112 

82 

102 

3 ih) 

Galvanic 

53 

101 

50 

103 

61 

115 


Faradic 

56 

120 

57 

110 

56 

105 


60 cycle 

56 

101 

56 

109 

71 

105 


25 cycle 

56 

97 

58 

87 

69 

107 

5 (hf,) 

Galvanic 

46 

107 

55 

108 

64 

115 


Faradic 

56 

109 

55 

104 

57 

103 


60 cycle 

64 

114 

66 

101 

62 

99 


25 cycle 

59 

102 

58 

98 

88 

135 



FACTORIAL EXPERIMENTS 5.32 


S.32 Numerical Example: a 4X4X3 Factorial in Sodomized 
Blocks. A number of experiments have indicated that electrical stimu- 
lation may be helpful in preventing the wasting away of muscles that are 
deneiwated. A factorial experiment on rats was conducted by Solandt, 
DcLury, and Himter (5.8) in order to learn something about the best 
type of current and the most effective method of treatment. The fac- 
and their levels are shomi below. 


Nuiiibpr of treatment 
periods daily 
1 , 

3 

6 


B 

Length of treatment 
(minutes) 

1 . 

2 

3 

5 


Type of current 

Galvanic 

Faradic 

60 cycle alternating 
25 cjmlo alternating 


Trea-tments were started on the third day after denervation and con- 
tinued for 11 consecutive days. There are 48 different combinations of 
methods of treatment, each of which was applied to a different rat. Two 
replications were conducted, using 96 rats in all. 

The muscles denervated were the gastrocnemius-soleus group on one 
side of the animal, denervation being accomplished by the removal of a 
small part of the sciatic nerve. The measure used for judging the ef- 
fects of the treatments was the weight of the denervated muscle at the 
end of the experiment. Since this depends on the size of the animal, the 
weight of the corresponding muscle on the other side of the body was in- 
cluded as a covariance variate. The data are shown in table 5.14. 

For a covariance analysis we require analyses of variance for both the 
denerx^ated muscle iy) and the normal muscle {x)^ and the analysis of the 
product iyx). From the discussion in section 5.27 and preceding sec- 
tions, the reader should have little difficulty in completing the analy^sis. 
The replicate totals, treatment-combination totals, and the grand total 
are first computed. From the treatment totals, the three two-way tables 
are constructed, as given in table 5,15. Since each set of main effect 
totals is obtained twice, this part is self-checking (though only one set of 
main effect totals need be recorded). To illustrate the computations 
made on the A X B table, we have for the analysis of y 


iov A X B table 


+ (543)" + . • • + (546)"] - ^[(6069)"] = 1039 

: *[(1936)^ + (2028)2 -{- (2106)2] - ^[(6069)2] = 447 

^[(1565)2 + (1488)2 + (1476)2 + (1640)2] - ^[(6069)2] = 223 
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By subtraction, the AB sum of squares is found to be 369. For the sum 
of products, we replace each by the corresponding yx product. Thus 

Total s.p. = (510) (1030) + (543) (977) + • • * + (546) (985)] 

- 9^(6069) (11,307) - 1151 

Finally, the ABC sum of squares is found by calculating the total s.s. 
among all 48 treatment combinations, and subtracting the siun of squares 
ioT B/Cj ABj ACyi%nd BCr 


TABLE 6.15 Two-way tables in a 4 X 4 X 3 factorial 





y 




X 





ai 

cs 

ae 

Total 


«s 

ae 

^ Total 

B 

h 

510 

543 

612 

1,565 

1,030 

977 

897 

i 2,904 

Length of 

h 

481 

496 

511 

1,488 

902 

1,038 

848 

2,788 

treatment 

h 

469 

471 

536 ; 

1,476 

941 

896 

1,000 

2,837 


h 

476 

518 

■ 546 

1,540 

841 

952 

986 

2,778 

C 

G \ 

442 

485 

• 518 1 

1,446 

917 

992 

925 

2,834 

Type of 

F ' 

482 

453 

441 i 

1,376 

952 

969 

873 

2,794 

current 

60 

503 

522 

557 ! 

1,582 

930 

985 

, . ,976: 

2,801 


„ 25' i 

500 

56S 

-. 589 I 

1,666 

915 

917 

956 i 

2,788 

A totals 


1,936 

2,028 

2,105 

6,0G9 

3,714 

3,863 

3,730 

11,307 





h 

y 

bs 

h 

h 

h 

bz 

h, 

€ 


393 

337 

359 

356 

723 

665 

700 

746 

Type of 

' ^ 'T / 

363 

337 

327 

349 

737 

729 

692 

636 

current 

60 

403 

395 

384 

400 

718 

743 

737 

693 


25 

406 

419 

406 

435 

726 

651 

708 

703 


The anali’^ses of yx, and are given in table 5.16. The regres- 
sion coefficient of weight of denervated on weight of normal muscle is 
3977/16,013, or 0.248361. The next step is to form the analysis of 
(y — bx)^. This i»s done by adding to any i/ value (—25) times the cor- 
responding yx value, plus (6^) times the corresponding value. The 
mean squares for (y — bx) are the quantities used to test the significance 
of treatment effects on the weights of denervated muscles, adjusted for 
their regression on the weights of the normal muscles. As shown in 
section 3.86, the treatment m.s. is slightly inflated because of the sam- 
pling error of the regression coefficient. The inflation is sufficiently 
small, however, that only effects which appear to be on the borderline of 
significance need be tested by the exact method (section 3.86), which is 
more laborious to calculate. ' ^ 



In the analysis of variance (table 5,16) only two factorial effects are 
significant : those for the main effects of C (type of current used) and for 
the main effects of A (number of treatment periods daily). ' The AC in- 
teraction m.s. is somewhat higher than the error m.s.j but does not ap- 
proach the 5% significance level These results suggest that in preparing 

TABLE 5.16 Analyses of variance 


Replications 

A (number of i'.reatmonts) 
B (length of period) 

C (typo of curireiit) 

AB 

C 

BC 

ABC 

Error 


Total 

^ 46 d.f. for (?/ • 
d.f. for 6/ ' 



' »> 

yx 

X? 

(X/ - 


d.f. 

s.s. 

s.p. 

s.s. 

s.s. 

m.s. 

1 

605 

2,503 

10,354 

0 


2 

447 

64 

418 

441 

220.5 * 

3 

223 

137 

415 

181 

60.4 

3 

2,145 

104 

281 

2,111 

703.7 

6 

36)9 

950 

5,202 

218 

36.3 

6 

645 

476 

1,014 

471 

78.5 

0 

299 

11 

2,015 

418 

46.4 

IS 

1,050 

666 

5,198 

1,040 

57.8 

471 

3,191) 

3,977 

16,013 

2,211 

48.1 

95-^ 

8,982 

8,888 

40,910 

7,091 



■ hx)K 


Note. In reference (5,8), the sums of squares for A (7 and ABC are in error. 

summary tables for further examination and for presentation of the 
results, we probabl3' need only the AC two-way table. The averages 
for B (length of an individual treatment) should also be considered in 
case they indicate a trend effeejt that was not marked enough to attain 
significance. These data appear in table 5,17. 


TABLE 5.17 WErairrs op denervateb mvsci.e, aujitsteu for regression on 
WEIGHT OP normal MUSCLE (unit ~ 0.01 gram) 


B means 

64.4 

62.4 

61.4 
64.7 

d:1.43 


AC two-way table (s.e. ±2.47) 



Number of treatments 

ai uz ao 

MeariJ 


G 

66.0 

59.1 

65.2 

60.1 

Type of 

F 

60.0 

55,8 

57.3 

57.7 

current 

00 

i 63.2 

63.9 

68.6 

65.2 


25 

' 64.5 ' 

71.8 

73 v2 

69.8 

Means 

60.9 

62.6 

66.1 

■63.2 


s.e. ±1.24 


s.e. ±1.43 
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The means have been adjusted in the usual way for the regression 
on the weight of the normal muscle. Because of this adjustment, every 
mean has a slightly different standard error. However, as pointed out 
in section 3.85, it is sufficiently accurate to use an average standard 
error. This is computed from the effective error m.s., which is 48.1 X 
1.019, or 49.0, as described in section 3.85. Thus the standard error for 
an entiy in the two-way table is V'lO.O/S^ or 2.47. 

The 25 cycle alternating current gave a significantly higher mean 
weight than any other type of current. The 60 cj^'cle alternating current 
was superior to both galvanic and faradic currents, the last two not being 
signifiicantly different. The weights increased as the number of treat- 
ment periods daily increased: On inspection the increase appears ap- 
proximately linearly related to the number of treatment periods. 

To test this supposition, we may fit . a regression of the adjusted A 
totals on the number of treatment periods (the adjusted A means do not 
carry enough decimals to check with the analysis of variance). The 
relevant data are shown below. 

Adjusted z ~ number 
A totals of periods 
ax 1950 1 

as 2005 3 £ (a - a )(2 -- g) « 421 .067 

ao 2115 0 == 12.067 

By the usual formula, the contribution of the regression to the sum of 
squares for A is (421.667) V(12*ffff’^) (3^)7 or 439. The last divisor, 32, is 
required because the A totals are totals over 32 rats. The sum of squares 
for regression accounts for practically all the sum of squares for .4 (441), 
so that there is no indication of deviation from linearity. The regression 
coefficient for a single rat is 421.667/(12.667) (32) or 1,0 unit. The con- 
clusion is that, within the range of periods tested, each additional ti*eat- 
ment period per day increased the weight by 1 unit. 

The B means in table 5.17 do not indicate any consistent effect of 
length of an individual treatment period (1 to 5 minutes). The major 
part of the AC interaction seems to come from the anomalous behavior 
of the faradic current, which produced a drop in weight from ai to as 
and ao. Since the interaction as a whole was not significant, it does not 
seem worth while to examine the statistical significance of this effect. 

5.33 Other Designs. The total number of treatment combinations in- 
creases rapidly as the number of factors or the number of levels of a fac- 
tor is increased. In this event the latin square necessitates an amount 
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of replication that is usually impracticable, while it becomes difficult to 
assemble homogeneous replications for a randomized blocks arrange- 
ment. Consequently, the experimental error per unit tends to increase. 
In order to avoid this increase in error, a number of designs have been 
produced. The basic principle, which is common to all these designs, is 
the use of a ^‘block” that is smaller than a complete replication. That is, 
the arrangements are such that the differences among these ''incom- 
plete'' blocks are eliminated from error, just as the differences among 
replicates are eliminated from error in a randomized blocks design. Un- 
fortunately, as will be explained later, this reduction in block size can be 
accomplished only by the deliberate sacrifice of accuracy on certain 
treatment comparisons. The designs fall into three main groups ac- 
cording to the particular treatment comparisons that are sacrificed in 
this way. 

In the jirnt groups the treatment comparisons that are sacrificed are 
the high-order interactions. These designs are appropriate in lines of re- 
seai'ch where experience has shown that high-order interactions are 
nearly always negligible. The designs available in randomized incom- 
plete blocks are described in chapter 6 and those which can be placed in 
latin squares in chapter 8. 

In the semid groupj loiown in agriculture as split-plot experiments, a 
factor, or a group of factors and their interactions, are sacrificed. These 
designs, which are very widely used, are discussed in chapter 7. Finally, 
in certain cases it is possible to construct designs so that the loss in ac- 
curacy is spread evenly over all factors and their interactions. Within 
this group there are several types: balanced lattices (chapter 10), bal- 
anced incomplete blocks (chapter 11) and balanced lattice squares 
(chapter 32). In cases where the effects of the factors and the sizes of the 
interactions arc rather unpredictable, these designs may be suitable. 
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of replication that is usually impracticable, while it becomes difficult to 
assemble homogeneous replications for a randomized blocks arrange- 
ment. Gonsequently, the experimental error per unit tends to increase. 
In order to avoid this increase in error, a number of designs have been 
produced. The basic principle, which is common to all these designs, is 
the use of a ^‘block’^ that is smaller than a complete replication. That is, 
the arrangements are svich that the differences among these '' incom- 
plete^' blocks are eliminated from error, just as the differences among 
replicates are eliminated from error in a randomized blocks design. Un- 
fortunately, as will be explained later, this reduction in block size can be 
accomplished only by the deliberate sacrifice of accuracy on certain 
treatment comparisons. The designs fall into three main groups ac- 
cording to the particular treatment comparisons that are sacrificed in 
this way. 

In the first group^ the treatment comparisons that are sacrificed are 
the high-order interactions. These designs are appropriate in lines of re- 
search where experience has shown that high-order interactions are 
nearly always negligible. The designs available in randomized incom- 
plete blocks are described in chapter 6 and those which can be placed in 
latin squares in chapter 8, 

In the second group, known in agriculture as split-plot experiments, a 
factor, or a group of factors and their interactions, are sacrificed. These 
designs, which are very widely used, are discussed in chapter 7. Finally, 
in certain cases it is possible to construct designs so that the loss in ac- 
curacy is spread evenly over all factors and their interactions. Within 
this gi-oup there are several types: balanced lattices (chapter 10), bal- 
anced incomplete blocks (chapter 11) and balanced lattice squares 
(chapter 12). In cases where the effects of the factors and the sizes of the 
interactions are rather unpredictable, these designs may be suitable. 
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There are tAvo important properties of this plan. The total of blocks 
1, 3, and 5, minus the total of blocks 2, 4, and 6, is the ABC interaction 
total. Thus the ABC interaction is one of the components of the com- 


CHAPTER 6 
CONFOUNDING 


6.1 The Principle of Confounding 

6.11 The 2'^ Factorial Experiment with Complete Confounding. This 
chapter deals with designs where the size of block is reduced by the 
sacrifice of accuracy on certain high-order interactions. The method 
will be illustrated first for a 2^ experiment with three factors, Aj B, and 
0, each at two levels. Since there are only 8 treatment combinations, 
the replication is not particularly large and in practice this experiment 
would most frequently be arranged in ordinary randomized blocks or 
perhaps in an 8 X 8 latin square. The example is chosen because of its 
simplicity. 

The interaction of highest order is the ARC interaction. It will be 
recalled (section 5.23) that this interaction is estimated from the com- 
parison 

(ahc) + (a) + (5) + (c) - (a6) - (ac) - (6c) - (1) 

Suppose that each replicate in the experiment is divided into 2 blocks of 
4 units each, such that one block contains a6c, a, 6, and c, while the other 
contains ab, ac, be, and (1). With 3 replicates the plan (before ran- 
domization) would be as follows. 

TABLE G.l 2’“* Experiment in blocks op 4 units, with ABC confounded 


Rep. I 


Rep. II 


Rep. Ill 


Block 


1 

2 

3 

4 

5,. 

6 

ahc 


ab 


ahc 


ab 


ahc 


ah 

a 




a 


ac 


a ■ 


ac 

h 


he 




be 


[r'.'b',: 


he 

e j 


(1) 1 




: (i) : : 


".'C: 1 


(1) 
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parisons amongst blocks. It is said to be completely confounded with 
blocks. Secondly, the other six factorial effects, A, JS, Cf ABy AG, md 
BC, will all be found to be orthogonal with the block totals. For in- 
stance, the x4jB interaction may be written (apart from the divisor) 

[(a6c) + (c) -- (a) - (5)] + [(a6) + (!) -- (ac) - (te)] 

Of the 4 units in any block, two carry a (+) sign in this expression and 
two carry a (— ) sign. Consequently, if we increase all the observations 
ill a selected block by any amount, say 60, the estimate of AB remains 
unchanged, and similarly for the other 5 factorial effects. This means 
that these 6 factorial effects are not influenced by differences amongst 
blocks. Various phrases are used to describe this property: the effects 
may be said to be miconfounded with blocks, or free from block effects, 
or to be composed entirely of within-block comparisons. 

Thus differences amongst blocks of 4 units are eliminated from the ex- 
perimental errors of the main effects and two-factor interactions, whereas 
with randomized blocks only differences amongst blocks of 8 units are 
eliminated. The reduction in effective block size is attained by making 
ABC the same as one of the block comparisons. There is no within- 
block information available about ABC. 

The degrees of freedom in the analysis of variance separate as follows. 


d.f. 


Blocks 


A, AC, BO 


Error 


Total 

■ ■■" 23.‘ : 


All sums of squares are calculated in the usual way, so that there is no 
complexity ill the computations. 

The composition of the error s.s. is perhaps worth noting. Consider 
the 3 blocks (1, 3, and 5 in table 6,1) that contain the treatments abc, a, 
b, and c. These may be regarded as a randomized blocks experiment 
with 4 treatments and 3 blocks. Consequently the interaction of treat- 
ments with blocks contains six components. Similarly in blocks 2, 4, 
and 6 the interaction of the other set of treatments with blocks provides 
six components. These two sets of six components constitute the error 
for the complete experiment. In other words, the error term in a con- 
founded factorial is made up of interactions between treatments and in- 
complete blocks. This remains true in the more complex designs that 
a,ppear later in this chapter, even though in these cases the composition 
of the error is less easy to detect. 
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ABC does not appear explicitly in the analysis above. Actually, the 
experiment, if properly randomized, provides an estimate of error and a 
test of signihcance for ABC. In practice it is seldom worth while making 
this test, which is usually very insensitive. The test may, however, 
throw additional light on the nature of confounding. Let us ignore all 
other factorial effects and consider table 6.1 as the plan of an e.xperiment 
to determine ylSC, i.e., to determine the difference between the group 
of treatments ahc, a, h, c and the group ab, ac, he, (1). From this point of 
view the block becomes the “experimental unit” and the experiment is 
of the ordinary randomized blocks type, having 2 “treatments” and 3 
replicates. The total s.s. among the 6 “units” is, of coui'se, the blocks s.s. 
in the previous analysis. Thus the blocks s.s. may be subdivided into: 

d.f. 

Replicates 2 

ABC 1 

Error for 2 

Total 5 

If confoimdiiig has been effective, the error for ABC, being composed of 
comparisons among blocks, will be larger than the erra (with 12 d.l) 
which applies to the rest of the experiment. Moreover, the error for 
ABC is estimated from only 2 d.f., so that the test of ABC is a poor one. 
The test might be worth making with a considerable ninnber of experi- 
ments of the same type, vdrere it is desired to examine the average ABC 
effect over the whole group, 

6A2 The 2‘^ Factorial Experiment with Partial Confounding. Any of 
the seven factorial effects may be confounded with blocks in this way. 
The rest will then be free from block effects. These facts enable us to 
spread the confounding in an experiment among several factorial effects. 
A plan of this type is shown in table 6.2. 

TABLE 6.2 2® EXPEitiMJ3NT in blocks of 4 units, with ABCy AC, EC 

FAimALLY CONFOUNDBD 

Eep. I Rep. II Hep. Ill 

Block 1 2 3 4 5 6 
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The difference between the totals of blocks 1 and 2 represents the ABC 
interaction. In replication II, however, the block composition has been 
changed so that the difference between the block totals is the AC inter- 
action. In replication III the difference is the BG interaction. 

With this plan, AyB, (7, and AB are entirely free from, block effects. 
ABC is completely confounded with blocks in replicate I, but in the other 
two replicates the estimate of dJSC is orthogonal with blocks. Thus a 
“within-block'^ estimate of ABC may be obtained from replicates II and 
III, 811^^113?-, a ‘^within-block^^ estimate of AC is available from rep- 
licates I and III, and one of BC from I and IL These three effects are 
partially Goidomided with blocks, since an estimate that is free from 
block effects can still be made for each effect. Each tSvithin-block^^ 
estimate is derived from 2 out of the 3 replicates in the experiment. The 
ratio % serves as a measure of the extent of the confounding. Yates 
(6.1) calls this ratio t}ie I'elative information Qia the confounded inter- 
action, The ratio gives the amount of information available on the 
partially confounded effect, relative to that available on an uncon- 
founded effect. 

In the analysis of variance, all factorial effects appear. 

di. 


Blocks 5 

A ■ ■ 1 

B 1 

C 1 

AB 1 

AC V 

BC' r 

ABC' 1' 

Error 11 

Total 23 


The sums of squares for blocks and for the unconfounded effects d, 
B, Cj and AB are found in the usual way. The sum of squares for AC is 
calculated only from replicates I and III, the divisor for [ACf being 16 
instead of 24, Corresponding rules appty to BC and ABC, In the anaty- 
sis of variance, the primes are inserted as a reminder that these effects 
are partially confounded and require special calculations. 

6.13 Confounding in the 2^ Series. In order to use confounded de- 
signs intelligently, it is not necessary to understand in detail the methods 
by which they are constructed. The rules for the construction of con- 
founded 2^ factorials will be briefly described, however, since they are 
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fairly simple and may help to explain the choice of plans presented at 
the end of this chapter. 

If a replicate is to be divided into 2 blocks, any factorial eflect can be 
confounded with blocks. In most cases we would confound the highest- 
order interaction in all replicates. Suppose that each incomplete block 
is to be further divided into 2, so that there will be 4 blocks (I - • • IV) 
in a replicate. Having chosen one factorial effect to represent the com- 
parison (I + 11 — III - IV), we may easily verify that any other fac- 
torial effect may be made to represent the comparison (I — II + III — 
IV), That is, to arrange a 2’" factorial in blocks of 2^“^ units, we ma^^ 
confoimd any two factorial effects that we choose. 

The remaining block comparison (I — II — III + IV) represents a 
third factorial effect, which is also confounded with blocks. Barnard 
(6.2) has sho\vn that this effect is uniquely determined by the two ef- 
fects that were chosen. It is always their “generalized interaction,” and 
is found by combining all the letters that appear in the two chosen fac- 
torial effects, and cancelling all letters that enter twice. Thus, if a 2^ 
factorial is to be confounded in blocks of 2*'^, or 8, units, we might be in- 
clined to choose ABCDE and BCDE as the first two factorial effects to 
be confoimded. The generalized interaction of these t^vo effects is 
ABBCCDDEE or, after cancellation, is A. Consequently, if these two 
effects are chosen for conf oimding, the main effect of- A is also automati- 
cally confounded. These results may be summarized as follows. “If a 
replicate of a 2^ factorial is arranged in blocks of units, three fac- 
torial effects are confounded with blocks. Of these, two be chosen 
at will: the third is their generalized interaction.” 

To proceed, a plan with 8 blocks in the replicate can be obtained from 
one with 4 blocks in the replicate by dividing each block into two parts. 
In making this division we may confound any factorial effect, except one 
of the three that is already confounded. Thus, in the 2^ example, where 
ABCDE ^ BCDE, and A were confounded in the division into 4 blocks, 
we might choose say ACE for the division into 8 blocks. But with 8 
blocks in the replicate, there are seven components of the comparisons 
among blocks, and each of these must repi'esent one factorial effect. 
Hence, in addition to the four factorial effects that are known to be con- 
founded, three others are confounded. These three are the generalized 
interactions of ACE with ABCDE, BCDE and A, or BD, ABD, and CE, 
respectively. 

The general rule should be clear from this case. Suppose that a rep- 
licate in a 2^ factorial is to be divided into blocks of 2^^^^ units, so that 
there will be 2* blocks in the replicate. Then we may select any k fac- 
torial effects to be confounded, subject only to the restriction that none 
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of these must be a generalized interaction of any group of the others. A 
further (2^^ — /c -- 1) effects are automatically confounded. These are 
all the effects which can be expressed as generalized interactions of the 
group of effects selected for confounding. 

In the construction of designs for practical use, it follows that the ef- 
fects to be confounded must not be selected without examination of the 
other effects that are automatically confounded. If confounding is to be 
restricted to high-order interactions, we cannot choose and 

BCDE in a 2^ factorial, since A would then be confounded. In order to 
avoid confounding any main effects or first-order interactions in this case, 
a little trial will show that the only possible choice is two second-order 
interactions and one third-order interaction: for instance Ai?C, ADE, 
and their generalized intei'action BCDE. The plans at the end of this 
chapter contain what appear to be the best choices for general use. 

When the interactions to be confounded have been selected, there re- 
mains the problem of writing out the plan showing the treatment com- 
binations that appear in each block. This may be done by the procedure 
used in the present discussion. The replicate is first divided into two by 
confoimding one effect, then into four by confounding a second, and so 
on. The rules which express each factorial effect in terms of the original 
treatment combinations (section 5.24) decide the composition of the 
blocks at each stage. For experiments with a large number of factors, 
arranged in blocks of small size, Fisher (6.3) has given an alternative 
method that is more expeditious [see also Finney (6.4)]. In this method 
a group of 2^^“^" letter combinations is constructed, such that the gen- 
eralized interaction of this group with any treatment combination gives 
the members of the block that contains this treatment combination. 

Fisher (6.3) has examined the general problem of eonfoundiiig when 
the object is to keep all main effects and first-order interactions com- 
pletely clear of block effects. He has shovm that experiments up to the 
2^ factorial (128 treatment combinations) may be placed in blocks of S 
units, while blocks of 16 units can accommodate experiments up to the 
2^® factorial (32,768 treatment combinations). 

The preceding discussion applied to a single replication. With more 
than one replicate we may repeat the first replicate, in which case the 
confounding is complete for the interactions involved. Alteimativdy, 
the effects that are confounded may be changed from one replicate to 
the next, so that some intra-block information is available on all effects. 

6.14 Example of a 2^ Factorial Confounded in Blocks of 8 Units, 
Table 6.3 contains the plan and yields of a 2^ field experiment on beans 
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TABLE 6.3 Pi-an and yibldes (beans in pounds) op a 2^ pactoeial 

EXPERIMENT 


Rep. I 


Rep. II 



p h d npk 


npk d p dnk 

Block a 

46 55 63 36 

Block a 

43 42 39 34 

375 

dnh dnp dpk n 

351 

n dnp k dpk 


41 48 65 42 


47 52 50 44 



i dp nk dk pk \ 

1 

nk dp (1) np 

Block h 

50 44 43 51 ! 

Block 6 

43 52 57 39 

381 

dnpk (1) dn np 

395 

pk dk dnpk dn 


44 58 41 50 


56 52 64 42 


conducted by the Rothamsted Experimental Station in 1936. The fac- 
tors were 

"Dung (D): none, 10 tons per acre. 

Nitrochalk (N): none, 0.4 c\vt. N per acre. 

Superphosphate (P) : none, 0.6 cwt. P2O5 per acre. 

Muriate of potash (J£) : none, 1.0 cwt. K2O per acre. 

With 16 treatment combinations in blocks of 8 plots, only one factorial 
effect is confounded in each replication. From the plan it will be seen 
that DNPKj the highest-order interaction, was confounded in each of 
the two replicates. The computations proceed as follows. 

Step 1, Calculate the totals for each treatment combination, the 
block and replicate totals, and the grand total The total s.s. and the 
sums of squares for replicates and for blocks within replicates should 
present no difficulty. 

Step 2, This deals with the computation of the factorial effect totals. 
Write down each factorial effect total as a linear function of the treat- 
ment combination totals, as shown in table 6.4. The treatment com- 
binations and their total yields are placed in a systematic order in the 
first two columns, and the remaining columns give the factorial effects. 
Although the writing of the table requires some time, the procedure is 
quite simple. When the columns for D and N have been written down, 
the column for DN is obtained as the product of corresponding signs in 
the D and N columns. Similarly, later in the table DNP may be ob- 
tained as the product of the signs of D and WP, or of N and DP, and so 
on. Thus it is necessary to know only the expressions for the main ef- 
fects. 

The factorial effect totals are shown at the bottom of each column. 


TABLE 6.4 CALctnuATioN of factorial effect totals in a 2^ experiment 
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The total for DiV’PiT is not requ^ since owing to the confounding 
DNPK will not appear explicitly in the analysis of variance. The DNPK 
column, however, allows a check to be made, as follows. If the 15 totals 
are added, it may be verified by inspection of table 6.4 that their simi, 
+66, should equal 16 times the {dnpk} treatment total, minus the grand 
total, i.e., 

16(98)-- 1502 - +66 

Yates (6.1) gives a different method for obtaining the factorial effect 
totals. His method is completely automatic and does not require the 
expressions for the factorial effects to be written down. The present 
method, though probably not so speedy as Yateses method, is more 
flexible in that only those factorial effects that are of interest need be 
obtained. Sometimes the experimenter will wish to isolate only the 
main effects and two-factor interactions, previous experience having 
shown that interactions of higher order are likely to be negligible. 

Step S, The contribution of each factorial effect to the treatments 
s.s. in the analysis of variance is now obtained. Since there are 32 plots, 
the square of each effect is divided by 32. The analysis of variance ap- 
pears in table 6.5. Note that DNPK is omitted, since it already appears 

TABLE 6.5 Analysis OF VARIANCE 


61.6 



d.f. 

BS, ■ ■ 

Replications 

' 'i '■ 

3.1 

Blocks ill reps. 

2 

123.2 

D 

1 

2.0 

N 

1 

325.1^** 

P 

1 

6.1 

K 

i 

4.5 

DN 

1 

32.0 

DP 

1 

242.0** 

NP 

1 

78.1 

DK 

1 

6.1 

NK 

1 

32.0 

PK 

1 

24.5 

DNP 

1 

2.0 

DNK 

1 

10.1 

DPR 

1 

15.1 

NPK 

1 

32.0 

Error 

14 

340.0 

Total 

31 

1277.9 


24.29 


in the blocks s.s. As a check, the total of the 15 treatment s.s., 811.6, 
plus that for DNPI[y 78,1, should equal the treatments s.s. as found in 
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the ordinary way from the totals for the 16 treatment combinations. 
The error s.s. is found by subtraction. 

Only two effects are significant: the main effect of nitrochalk, which 
produced a depression of the yield, and the DP interaction. Since 
neither D nor P produced significant main effects, it is advisable not to 
lay much stress on the DP interaction in the absence of other confirm- 
atory evidence. 

Step 4- The best presentation of the results will depend on tlieir 
nature and on the audience for whom they are intended. Table 6.6 gives 

TABLE 6.6 Differential responses (pounds per plot) 


llesponse with 


Factor 

Mean 

response 

Dung (D) 

Nitrochalk {N) 

Super. (P) 

Mur. pot. (iiT) 



Abs. 

Pres, 

Abs. 

1 

Pre.s. 

i 

Abs. 

Pres, 

Abs. 

Pres. 

Dung (2)) 
Niti’ochalk (iV) 
Super, (P) 

-0.5 

-6.4 

-j-0.9 

-8.4 

-4.6 

-4.4 i 
+6.4 

-2.. 5 

-2..2 

+1.5 

+4.0 

--6.0 

-9.5 

+5.0 

-3.2 

+0.4 

-4.4 

-0.9 

-1.4 

-8.4 

+2.6 

Mur. pot. {K) 

-0.8 

+0.1 

-1,6 

+1.2 

-2.8 

-2.5 

+1.0 




s.e.; =i:2.46 for differential response; d=1.74 for mean responses. 


a concise presentation of the main effects and two-factor interactions in a 
form that has been used frequently at Rothamsted Experimental Sta- 
tion. The response to each factor is sho\\m separately for each level of 
every other factor. Thus the row labelled ^‘Dung” contains the mean 
response and the differential responses to dung. The figure —2.5 (re- 
sponse to dung with iiitrochalk absent) is the average response to dung 
over all plots that did not receive nitrochalk. The table enables a quick 
appraisal to be made of the nature of the main effects and tAVo-f actor 
interactions. In the present case, of course, the table is scarcely needed 
because of the dearth of effects. 

The data in the table are easily calculated from the factorial effect 
totals in table 0.4. For instance. 


Mean response to dung = 



■ 8 .': "'V' 

-0.5 

Mean response to dung 


- [DN]] - 



(nitrochalk absent) = 

~ :;1 6',. U 

:+2.5 

Mean response to dung 

iVlP] 

+ [DN]} = 



(nitrochalk present) = 

-+u- 

+ 1.5 
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These relations may be verified from the signs in the columns in table 

6.4. 

The standard error per plot is V24.29, or 4.93, Since each differential 
response in table 6.6 is the difference between the means of two groups 
of 8 plots, its standard error is (4,93)/2, or 2.46. The standard error for 

a mean response is times this value, or 1.74. 

6.16 Confounding of a 3^ Factorial. Let the symbol ij denote the 
treatment combination that has the fth level of A and the jth level of 
B {i, j = 0, 1, 2). As before, we wish to keep the main effects clear of 
block effects, that is, to confound only the AB interaction. Since there 
are 9 treatment combinations in a replicate and since 3 is the only factor 
of 9, the size of the incomplete block must be 3 units. 

We have seen (section 6.26) that the main effects of A and B both con- 
tain two components, while the AB interaction has four components. 
Further, the main effects and the interaction can be divided into single 
components in various ways, and the division that is most appropriate 
for interpretation of the results will change from experiment to experi- 
ment. This suggests that the particular components of the interaction 
which we should desire to confound will vary from case to case. Actu- 
ally, the possibilities for confounding are much more restricted with the 
3^ than with the 2^ system: in fact, only one set of components of AB 
lends itself readily to confounding. 

The confounding is based on the properties of the 3X3 graeco-latin 
square. Suppose that the 9 treatment combinations are set out as in table 
6.7, on which a 3 X 3 square is superimposed. Following the notation 

TABIjE 6.7 UsB OF A 3 X 3 geaeco-latin squabe to obtain the AB 

INTERACTION 



h 

h 

h 

tto 

(00)Ji/i 

(01)/®/2 

(02)73/3 

at 

(10)13/2 

(ll)/l/3 

(12)72/1 


(20)/2/3 

(21)/a/i 

(22)7i/2 


used by Yates (6.1), we denote the latin letters in the square by /x, / 2 j 
and Is, respectively, and the greek letters by Ji, J 2 , and Js, respectively. 

It is clear that comparisons among the row totals of the square give 
the two components of the main effect of A, while comparisons among 
the column totals give the main effect of B, Now consider the I totals: 

Ji ~ (00) + (11) + (22); I 2 ^ (01) + (12) + (20); 

Is = (02) + (10) + (21) 
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By the basic property of the latin square, compariBons among these 
totals are orthogonal to both rows and columns: that is, to the main ef- 
fects of A and B. Hence, the comparisons among the I totals must 
represent two of the four components of the AB interaction. The same 
argument shows that the two remaining components oi AB are derived 
from the comparisons among the J totals, where 

/i- (00) + (12) + (21); Jg- (01) + (10) + (22); 

Jz - (02) + (11) + (20) 

A numerical verification of this fact was made at the end of section 5.26, 
where the sum of squares for AB was calculated from the latin and 
greek letter totals which correspond to the / and J totals. 

The application of this result to confounding is shown in table 6.8, 
There are two possibilities. In plan (i) the 1 components of AB are com- 
pletely confounded with incomplete blocks, since the block totals have 
been made the same as the I totals. The main effects and the J compo- 
nents of AB are unconfounded. In plan (ii), the J components of AB are 


TABLE 6.8 3X3 Experiment with AB partialuy confounded 


Plan (i) 

Incomplete blocks 


(i) 

(ii) 

(iii) 

(00) 

(01) 

(02) 

(11) 

(12) 

(10) 

(22) 

(20) 

(21) 


I 2 

h 

1 components confounded 


Plan (ii) 

Incomplete blocks 


(i) 

(ii) 

(iii) 

(00) 

(01) 

(02) 

(12) 

(10) 

(11) 

(21) 

1 (22) 

(20) 

Jx 



J components confounded 


completely confounded. It should be noted that neither the I nor the 
J components are easy to interpret in terms of the results of an actual ex- 
periment. They are selected for confoimding because they are con- 
venient for this purpose. 

If the number of replicates in an experiment is even, we may use plan 
(i) in half the replicates and plan (ii) in the other half. With this ar- 
rangement the I components may be estimated (clear of block effects) 
from those replicates in which plan (ii) is used, and vice versa for the / 
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components. The relative accuracy is for each set of components. 
It may be shovTi that the relative accuracy is also for ant/ component 
of 4 B in which we may be interested. The confounding is said to be 
balanced with respect to AB. Balanced confounding is preferable, for 
there is no reason to confound I more heavily than J. 

This design has limited practical utility. It may be used where the 
factors are known to operate independently, so that will be neg- 
ligible. If information about AB is wanted, a randomized blocks design 
is advisable unless there will be a substantial reduction in error variance 
from the use of incomplete blocks. The same principle of construction, 
however, leads to a very useful design for the 3^ factorial. 

6.16 Confounding of a 3^ Factorial. The same notation will be used. 
Thus, (021) denotes the treatment combination ao& 2 C.i‘ Since there are 
27 treatment combinations, the possible sizes of incomplete block are 
3 and 9 units. The plan for blocks of 3 units necessitates confounding of 
the two-factor interactions and is not given here. With 9 units per 
block, only ABC need be confounded. In the case of the 3^ factorial we 
were able to divide the 9 treatment combinations into groups of three 
(the I and J groups), such that comparisons among the group totals 
gave the components of AB. We shall show that the 27 treatment com- 
binations in a 3^ factorial can be divided into groups of nine such that 
comparisons among the group totals give the components of ABC. Each 
set of three groups will contribute t\vo components of ABC. Since ABC 
has in all eight components or degrees of freedom, there will be four such 
sets,-'"'.. ■ ■ 

The AB interaction can be calculated separately for each level of C. 
As before, we will obtain the AB interaction from its I and J compo- 
nents, Table 6.9 shows the I components for each level of C. 


TABLE 6.9 The I components op AB shown por each le-vose op C 



Co 

Cl 

■ 

h 

(000) + (110) + (220) 

(001) + (111) + (221) 

(002) + (112) + (222) 

h 

(010) + (120) + (200) 

(Oil) + (121) + (201) 

(012) + (122) + (202) 

h j 

(020) + (100) + (210) 

(021) + (101) + (211) 

(022) + a02) + (212) 


We may regard table 6.9 as a 3 X 3 table, in which each entry is the 
total of 3 treatment combinations. The row totals of the table give the J 
components of AB, while the column totals give the main effect of C. 
Further, we may take I and J totals from this table just as in table 6.7. 
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These totals will be called J — Ii, J — Ji, etc., since they come from the 
J components of Thus 

1 - h: (000) + (110) + (220) + (Oil) + (121) -f (201) -f (022) + (102) -f (212) 

I - hi (001) + (111) + (221) + (012) + (122) -f (202) (020) 4- (100) + (210) 

1 - h: (002) + (112) -f (222) + (010) -f (120) -j- (200) H- (021) + (101) + (211) 

By the same argimient as for the 3 X 3 design, the comparisons among 

these totals of 9 treatment combinations must represent two of the com-** 
ponents of the interaction of AB with C ; that is, of the ABC interaction. 

Consequently if we put all treatment combinations in I l i into the 
first block, those in I — I 2 into the second, and those in I — J 3 into the 
third, we have a plan which completely confomids two of the eight com- 
ponents of ABCj and leaves all other factorial effects unconfounded. 
This arrangement appears as replication III in plan 6.7 for the 3^ fac- 
torial at the end of this chapter. 

The second set of three groups of 9 treatment combinations is obtained 
by taking the J totals from table 6.9. 

I -Jn ( 000 ) + ( 110 ) + ( 220 ) + ( 012 ) + ( 122 ) ( 202 ) 4 ( 021 ) + ( 101 ) + ( 211 ) 

I - Jr. (001) 4“ (111) 4- (221) 4 (010) 4- (120) 4- (200) 4- (022) -f (102) + (212) 

I - ^3: (002) 4 (112) 4- (222) 4 (011) 4 (121) 4 (201) 4 (020) 4 (100) 4 (210) 

This grouping constitutes replication IV in plan 6.7. The remaining 
sets are obtained by forming a 3 X 3 table similar to table 6.9 for the J 
components of AB at each level of C. The reader may verify that the 
J — Ji, J I 2 , and J — Z 3 groups are as shown in replicate I and the 
J — Jiy J r- and / — Jg groups in replicate II of plan 6.7. 

6.17 Example of a 3^ Factorial Confotinded in Blocks of 9 Units. 
This experiment, conducted by the Seed Laboratory, Iowa State Col- 
lege^ tests the effects of 3 levels of nitrogen, 3 of phosphoriivS, and 3 of 
potash on the germination of lettuce seedlings. The seed was thoroughly 
mixed, and divided into 108 samples of about 60 seeds each. Each 
sample was planted in a copper box, 6 in. square and deep, in a 

mixture of soil and sand. The boxes were placed in a germinator, at a 
temperature of about 32 °C. At the end of 5 to 7 days the seedlings 
were classified as normal, abnormal, hard, or dead. The data in table 
6.10 show the numbers of normal lettuce plants. 

There were 4 replications, each placed on a different shelf in the ger- 
minator. On a shelf the boxes were placed in 3 columns of 9 boxes each, 
each column being an incomplete block. It will be noted that all 3 fer- 
tilizers had a deleterious effect on emergence. 



0.17 


CONFOUNDING 




693 



No. 

COCOCDCOCMCMCDOS’^ 
(IISl pH pH y-i rH rH rH T-^ 

Hhi 

■CO 

1 rH 


No. 

0 ■ - ■ 

OOOy-irHCMCMCMtH 

ScMOOlrHCSlOrHO 

Or-iOlOCMCMOrHrH 
+■ ++ 



c 

npk 

No. 


251 

d 

(V 

■6 

05 . 

% 

S? 

g ^ CM O ^ rH e CM 

rHOCMCMCMOrHOrH 

i + 


P4 

Pq 

d 

CD pH tH p rH CO lO 00 

CO CO CO CO ^ CO CM 50 CO 

308 


No. 


PHOCMr-OO^CMrHO 

CM CM O O O CM rH rH rH, 
rH CM rH 05 O O CM O rH 

1 1 i 





xo 

CM 

rH 


CM 

00 

o 

o 

CJ5 

pH 

Tfi 

tH 

CM 

XO 

CM 

CO 

iO 

CM 

CM 

rH 

CM 

o 

o 


o 

(M 

rH 

CM 

o 


pH 

Q 

tH 

CM 

o 

CM 

CM 

CM 

o 

CM 

o 

tH 

pH 

rH 

o 

+ 




1 






t^co«:o500i>-oot"i(N 

T}^COrH(^^(^^C<^COCO(rO 


O C<i r-H <M 1-H (M o T-i O 

OOfMrHO(N(M?-iT-i 


•i-H- 


+ 


t>*rHTiHji5C5500lOt^CO 

CCiCO^<MCO(NCOCO<M 


<NrHO{MOrH(Mr-HO 
(M r~* O ^ p O 
t-<r-lOOT-lCMgMOC>^ 


! 33 O 0 OOtfO‘^N-r-fii 0 C<l 
Tfi TjH UD U^I lO CO CO 


^ cq r~< r*< o CM <M O O 
rHOCMQt-^i^CMCMO 
T-i CM O O CM O tH CM 


+ 


OCMCMQ 

OOCMCM 

t-^OCMO 


O CM O rH o t-h CM v-j CM 
CMCMOOr-fi-HOC^r-^ 
CMrHt~ 4 CMOi^OOCM 


C55i0001>-00000ic 

rHCMrWCMCM^COCMCM 


O tM w O 05 ^ CM CM C 
CM r~H CM rH CM o r-i o C 
O CM t-H rH CM O O rH c 


-f- 


-f 


197 232 233 662 302 290 311 903 

The use of the + and — signs is discussed later in this section. Grand total 3177 


TABLE 6.11 Additional summary totals 


6.17 3® FACTORIAL CONFOUNDED IN BLOCKS OP 9 UNITS 


2 ^ 










H 


cj : 


a, -H 


s 

c3 

+3 

'<f 

. o. 

b 

H 


3177 

919 

2258 

Total 

t ■ ■ 

In CO 

rH «0 

CO 

2484 

! 

1 Total 

3177 

662 

2515 

Total 

r— 4 


11-0 ^ 

iH 

Cf5 00 

"■0 >, 

CO 05 

>. 

CO o 

Tt* CO 

tH O 

CO 

O 

r-l CO 

t> CN 

05 T-t 

00 CO 

OCNl 

00 Ttl 

t-H 




r-i 


cq tH 


COrH 

PI 

COiM 

iH . 

o »o 

RQ 

t-M »0 

CD cq 

j> CO 

ii cq 

rH CO 

{M 


00 ^ 

O <M 

c6 ^ 

tH 


rH 


rH 



■ CO 

OiQO 

tH 

vOI>. 

CO 

rHt>- 

I> . '«d 

rH O 

H -r^. 

<N Ci 


O rH 

1> 

rHCO 

CO eo ^ 

O r-I 

00 



tH 


rH 1 



oi 


^ A 
cu & 
o 

ii 

9 9 


I 

I 


<D 

s s. 

p g 




. <D Ph 

43 S 

P i 

2 2 



I 1188 
1058 
931 

1138 

1044 

995 

fc . . ■ ., 

rH 

CO 

CO^ iO 
' C0’'?ti O5' ■ 
tH O 05 
rH rH 

■tH’. 

CO 

(M 

Si' 

«D.rH.<N 

1 .CN'.CS rH. 

- C0 <M CO 

lO' MOq ' ■ 
CS A^ 05 ■ 
C^COCVI 

929 : 

■ , 04 i 

PL 

C<J to 
rH CO 00 
CO CO CSJ 

■ tH^ 

.05' 

H 

' ' 

CO 00 CO 
,tH W01> ■ 
CO <N 

iH'COAO 
CO iH to 
CO CO CO 

1049 

H 

PL 

0 05 O) 
OOCOCO 1 
: CO CO CO 

1058 

■o 

S'. 

449 

409 

341 

(NO5Q0 

CO.QOhH 

COCO 

' Oi ' ■ 

. 

rH 
tH , 

"o 

0> tH rH 
■THtNtN, 
CO CO 

j 1188 


! 

SSh 

' , o' . 'H (N ' 

''i 

OH C 4 



169 


From treat, totals 1104 991 1082 3177 

Prom rep. 4 302 290 311 903 


170 6.17 

The computations pi’oceed as follows. 

Stef 1. Form the block and replicate totals and the grand total 
(shown in table 6.10) and also the totals for each treatment combination 
and the three two-way tables (shown in table 6.11). 

Step 2. These data enable us to calculate the total s.s. and the sum 
of squares for replications, blocks rvithin replications, and for the N, 
P, K, NP, NK, and PK factorial effects. All are obtained in the usual 
way and are entered in the preliminary analysis of variance (table 6.12). 


TABLE 6J2 Analysis of variance for 

A 3 X 3 X 

3 FACTORIAL 


di. 

s.s. 

m.s. 

Ileplications 

3 

2,041.88 


Blocks within replications 

8 

5,008.15 

626.02 

N 

2 

1,016.67 

508.34 ** 

P 

2 

917.39 

458.70 ** 

K 

2 

293.39 

146.70 

NP 

4 

399.27 

99.82 

NK 

4 

589.61 

147.40 

PK 

4 

212.89 

53.22 

NPK: confoundod in replications 




1 

2' 

25.21 

12.60 

2 ■ 

2' 

64.22 

32.11 

3'^ ' 

r 

6.39 

3.20 

4' '■ 

2' 

198.30 

99.15 

Error 

70 

4,146.88 

69.24 

Total 

107 

14,920.25 


Subdivision of part 

of the treatments s.s. 


N: L 

1 

1,012.50 

1,012.50 

Q 

1 

4.17 

4.17 

P: L 

1 

917.35 

917.35 

Q 

1 

0,04 

0.04 

K: L 

1 

284.01 

284.01 * 

Q 

1 

0.37 

9.37 

NP: LXL 

1 

184.08 

184.08 

■Best' ■ 

3 

215.19 

71.73 

NK: LXL 

1 

256.69 

256.69=^ 

Itest 

3 

332,92 

110.97 

NPK: LX LXL 

r 

59.12 

59.12 


Notice that we do not calculate the total s.s. for treatments (with 26 
d.f.), since this contains part of the blocks s.s. 
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Step 3, There remains the calculation of the contribution from the 
NPK interactions. Before doing this, it may be remarked that some- 
times, from the nature of the factors or from previous experience, there 
is good reason to believe that the three-factor interactions will be neg- 
ligible. In this case the experimenter may decide to pool the sum of 
squares ior NPK with the error s.s, without troubling to compute the 
sum of squares for NPK, The pooled error would have 78 di. and would 
be obtained by subtraction of the sum of squares calculated in step 2 
from the total s.s. The danger from this procedure is that if three- 
factor interactions are present the experimenter may not detect them, 
and moreover his estimate of error will be inflated. For many types 
of field experimentation in agriculture the procedure seems reasonably 
safe. When there is doubt it is better to isolate NPK. 

Consider the two components of NPK that are confounded in rep- 
lication 1. The contribution of these components to the sum of squares 
for NPK must be calculated from the remaining 3 replicates, in which 
they are unconfounded with blocks. The totals needed are shown in 
table 6.11 under the heading ^^NPK components.’^ From the treat- 
ment totals, compute the total (944) of the 9 treatment combinations 
that appear in block Iri-. Thus 

Block lA = 012 + 122 + 220 + 202 + 101 + 021 + 000 + 110 + 211 
944 = 119 + 81 + 92 + 96 + 82 + 99 + 171 + 118 + 86 

In the same way we obtain 1102 for the total over all treatments that 
appear in block IB and 1131 for block 1C. Underneath these figures 
we place the respective totals for blocks L4, li?, and 1C. By subtraction 
we obtain the totals 773, 748, and 737. These are the totals of the groups 
of treatment combinations taken over replications 2, 3, and 4. The sum 
of squares of deviations of these quantities from their mean is divided 
by 27, since each figure contains 27 observations. The result, 25.21, is 
the contribution of the two components of NPK to the sum of squares 
for NPK, The remaining six components are found similarly from rep- 
lications 2, 3, and 4. Of course, all eight components could be computed 
in one step as 

-^[(773)^+(748)H ‘ ‘ • + (701)2+ (771)2] ~^[(225S)H . , . + (2274)2] 

The error s.s., with 70 di., is calculated by subtraction. 

Step 4, In an experiment of this t 3 npe, w+ere each factor has equally 
spaced levels of an ingredient, it is usualty advisable to examine the linear 
and quadratic components of the response curves. The contributions 
to the treatments s.s. are displayed in the lower section of table 6.12, 
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The method of caleulotioii desoribed in section 5.26, '"'h^e dnta 
taTand P in Ms eiperiment wem used as an example ^ All toee fee- 
show significant linear responses, with no mdrcation of any de. 

^rd“ too-factor interactions, the “linear by Un»r'' com- 
ponents have been isolated for IVP and NK (s» section 6.26 for the 
method). For PlCit was not thought worth while to do this, since the 
total s s for PK, 212.89, is not large enough to allow any single compo- 
nent to be significant. The "linear by linear” component is significant 

for iVif but not for iVF. o,, a r w • * +• 

Where the “linear by linear” components of the two-factor interactions 

are large, it is desirable to isolate the "linear by linear by linear” compo- 
nent of the three-factor interaction. This will be done as an exercise. 
First we must define this component. With two factors, say N and P, 

the JViPx, component has been defined as 

(22) -b (00) - (20) - (02) 

where the numbers refer to the levels of JV and P. Idie value of this 
quantity at the highest level of K, nvinus its value at the lowest level of 
K, measures the linear effect of K on N lPl, and is the NlPlPl compo- 
nent in question. Algebraically, it is 

L = (222) 4- (200) + (020) + (002) - (022) - (202) - (220) - (000) 

The estimate from the treatment totals will be found to be --27. 

Since three-factor interactions are partially confounded with blocks, 
this component is also partially confomided and must be adjusted so as 
to remove block effects. The nature of the confounding is seen by miting 
down the expression above in the original table of plot yields (table 6.10). 
This has been done by means of the -j- and — signs that appear to the 
left of the seedling numbers in the table. It is evident that L is orthog- 
onal to the totals of blocks 1C, 2B, 3C, and 4C, since each of these 
blocks has one H- and one — • In blocks IP, 2C, 3P, and 4P, P has three 
-|- signs, while in the remaining blocks it has three signs. Hence, a 

quantity that is free from block effects is 

3L - (IP) - (2C) - (3P) - (4P) -f (U) -4- (2A) + (3d) + (4d) 


3 (-27) - 364 - 134 - 232 - 290 + 171 -b 308 -b 197 -b 302 = -113 
the block symbols denoting block totals. This expression, apart from a 
divisor, may be shown to be the least squares estimate of NiPiM-l, ad- 
justed for block differences. 

If this quantity is expressed algebraically as a linear fimction of the 
plot yields, the sum of squares of the coefficients is found to be 216. The 
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contribution of NlPlKl to the treatments s.s. is therefore (113) 6, 

or 59.12, which is practically the same as the error m.s. If the number 
of replicates differs from foiirj the procedure for isolating the '^linear by 
linear by linear” component remains the same, except that the divisor for 
the square becomes 54r instead of 216. 

Step 5. This concerns the presentation of the results. Usually it 
will be sufficient to show the three two-way tables of means, which are 
derivable from the two-way tables of totals (in table 6.11) on division 
by 12. Since all main effects and two-factor interactions are imcon- 
founded, standard errors and jS-tests for the 3X3 tables are obtained 
Just as in a randomized blocks experiment. The principal results are 
that each fertilizer has produced a significant decrease in the numbers 
of seedlings that emerged, the decrease being substantially proportional 
to the amount of dressing. The significant NlKi, interaction represents 
the fact that the decrease in emergence from no to ^2 was smaller at the 
/c 2 level than at the Jcq level. There is some indication of a similar effect 
with iV and P, though this did not attain significance. 

The individual 27 treatment totals or means cannot be used as they 
stand for interpretative purposes, since they contain some block effects. 
A table of these totals or means will probably be unnecessary unless some 
aspect of the three-factor interaction requires study. To obtain such a 
table, we adjust each total so as to remove block effects. Each block 
effect is first estimated. For this purpose we do not use simply the ob- 
served block mean, since that in turn contains treatment effects. The 
least squares estimate of any block effect is 

^-[4 (block total) — (total of treatments appearing in the block)] 

The basic data needed are available in the section headed com- 

ponents” in table 6.11. Thus, for block L4, the estimated effect is 

w[4(l71) - (944)] - --9.6 

The block effects are given below. 


Block 


Replication 

A 

B 

C 

1 

-9.6 

+11.6 

+16.5 

2 

■f4.2 

-4.0 

-16.1 

3 

-8.8 

-5.4 


4 

+3.9 

+6.3 

+6.0 


To adjust any treatment total, we note from the plan the 4 blocks in 
which it appears, and compute the sum of the effects for these 4 blocks. 
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This quantity is BuUracted from the unadjusted total to give the ad- 
justed total. Thus for nipilcQ, which appears in blocks L4, 2^, ZB, and 
4-jS, the adjusted total is 

118 [-9.6 + 4.2 - 5.4 + 6.3] - 122.5 

To obtain the adjusted mean we divide by 4 as usual. 

6.18 Mixed Series: the 3 X 3 X 2 Factorial. In the designs discussed 
in previous sections, all factors have the same niunber of levels. When 
the number of levels differs from one factor to another, the utility of 
confounding is limited. Usually some two-factor interactions must he 
confounded, and tlie computations become more laborious. Only five 
designs of this type are given at the end of this chapter, though a number 
of others are available in the literature. As an example, we will consider 
the confounding of a 3 X 3 X 2 factorial, vhich has 18 treatment com- 
binations. 

The main effects of a factor will be kept clear of block effects if every 
block contains each level of the factor the same munber of times. Thus 
the factor A, which occurs at 3 levels (0, 1, 2), is imconfoimded if every 
block contains an equal number of O’s, Us, and 2's. Consequently, block 
size has to be a multiple of 3. Similarly we can keep the main effects of 
B (0, 1, 2) imconfoimded if the block size is a multiple of 3. For the 
factor C at 2 levels (0, 1), the block size must be a multiple of 2. Hence, 
in order to keep all main effects clear of blocks, the only feasible block 
size is 6. 

Further, mth 6 units in a block, every possible combination of the 
levels of A and those of C can appear once in every block, so that we may 
expect to be able to keep AC, and likewise BC, imconfounded. We can- 
not place all the 9 combinations for AB in a block, so that AB will be 
partially confounded. 

From this approach the plan can be constructed rather easily. Every 
block is to contain the 6 combinations (00), (10), (20), (01), (11), (21) 
of B and C. The only question is the manner in which the 3 levels of A 
are combined with the 6 pairs above. This allocation is to be such that 
all 6 combinations of AC appear in the block. That is, the 0, 1, and 2 
levels of A must each appea.r with the 0 level of C and each with the 1 
level of C. We may impose one additional rule, designed to confound 
AB as little as possible. Although we cannot represent all 9 combina- 
tions of AB in a block, it will be well to represent as many as possible. 
Thus we introduce the restriction that any AB combination (e.g., 12) 
must not appear more than once in a block. 

Under these rules only four types of replicate can be made up, as shown 
in table 6.13. 
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eonsider, for instance, in how many ways the bb containing (000) 
can be constructed. The first three A' levels (reading from the top in 
table 6.13) must be either 0, 1, 2 or 0, 2, 1, so that each A level shall ap- 
pear with the 0 level of G, If 0, 1, 2 is chosen, the remaining three A 


TABLE 6.13 Possible blocks for a 3 X 3 X 2 factorial 




1 . Level of 

B 

C 

la 

lb 

Ic 

Ila 

II& 

lie 

^ IIIo 

UJIr 

11 Ic 

IVa 

IV6 

IVc . 

0 

0 

1 

2 

0 

2 

0 

1 

i 1 

2 

0 

1 

2 

0 

1 

1 

0 

2 

0 

1 

0 

1 

2 

0 

1 

2 

1 

2 

0 

2 

0 

0 

1 

2 

1 

2 

0 

2 

0 

1 

0 

1 

2 

0 

1 

2 

0 

1 

1 

2 

0 

2 

0 

1 

1 

2 

0 

1 

1 

0 

1 

2 

2 

0 

1 

1 

2 

0 

0 

1 

■ 2 , 

2 

1 

1 

2 

0 

0 

1 

2 

0 

1 

2 1 

.2 

0 

1 


levels must be either 1, 2, 0 or 2, 0, 1, these being the only *sequences that 
make no AB combination appear twice. These sequences give blocks 
Ic and II5 respectively. Similarly, the choice of 0, 2, 1 for the first three 
A’s leads only to blocks IIIc and IV6. 

Ill the same way, we find that there are only four possible blocks con- 
taining the treatment combination (001) and four containing (002.) 
The three sets of 4 blocks can be grouped into 4 separate replications 
(I, II, III, and IV in table 6.13). 

As Yates (6,1) has shown, the plan can be mitten in a more con- 
densed form (table 6.14) which exhibits the nature of the confounding. 


TABLE 6.14 Condensed poem of the plan for a 3 X 3 X 2 confounded 

, factorial ' ■ : 


Level 

. . ■ ■ ■'Blocks' ■ . ■ ■■ 

of C 




1, ' .■■■■ 


.! 

I ■ ■■ ■' 







■ .la^. 

Ih 


Ila 

Ih 

He 

Ilia 

llli, 

I1I« 

IVa 

IVft 

IV« 


h : 

h 

h 

.A;' 

h 

■JM 

Ts 

A 


A 

Ji 

J2 


h 

h 


h 

h 



Ji 

Jz 


Js 



For example, in block the 3 combinations that appear with the 
zero level of C are (01), (12), and (20). This is the group that forms the 
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1% component of AB (see section 6.15), It follows that in the first 2 
replicates the I component ofvlB is partially confounded* the J compo- 
nent can be verified to be imconfounded. In replicates III and IV the 
/ component is partially confounded, while the I component is clear of 
blocks. .4 JSC is partially confounded in all replicates. The relative in- 
formation on has been shown to be % and that on ABC, 

6.19 Numerical Example of the 3 X 3 X 2 Design. Table 6.15 shows 
the plan and analysis of variance for an experiment on the response of 
young timg trees to fertilizers. The results of this experiment have been 
reported by Merrill, Kilby, and Greer (6.17) , The general object was to 

TABLE 6.15 Data for 3 X 3 X 2 factorial experiment in blocks of 6 units 

Ila 115 lie 
200:136 000 : 43 100 : 89 

010:56 110: 81 210: 87 

120: 64 220: 90 020: 66 

101 : 95 201: 81 001: 91 

211: 76 Oil: 61 111: 97 

021: 71 121: 65 221: 60 


498 421 490 1,409 

iva m m 

200 : 88 000 : 53 100 : 81 
110:107 210: 66 ' 010: 58 
020: 70 120: 92 220: 56 
101: 79 201: 88 001: 90 
011:92 111:109 211:68 
221: 96 021: 95 121: 67 


632 503 420 1,455 

Analysis of variance 
Source 

Replicafions 

Blocks witliin replications 
Levels of 8-8-6 in row (A) 

Meals (R) 

Levels of 8-8-6 side-dressed (0) 

8-8-6 in row X meals (AR) 

8-8-6 in row X 8-8-6 side-dressed (AC) 

Meals by 8-8-6 side-dressed (BC) 

8-8-6 in row X meals X 8-8-6 side-dressed (ABC) 

Error 


d.f. 

s.s. 

m.s. 

3 

3,837 

1,279 

8 

2,836 

354 

2 

1,116 

558 

2 

254 

127 

1 

868 

868* 

4' 

1,129 

282 

2 

2,995 

1,498** 

2 

424 

212 

4' 

1,016 

254 

43 

8,909 

207 

71 

23,384 



la. 


15 


Ic 

100 

80 

200 

00 

000 

38 

210 

86 

010 

55 

no 

73 

020 

70 

120 

82 

220 

75 

201 

74 

001 

67 

101 

78 

on 

82 

111 

67 

211 

51 

121 

86 

221 

57 

021 

66 


478 


406 


381 


Ilia 

III5 

lllc 

100: 

86 

200: 

73 

000: 

66 

010: 

79 

110: 

76 

210; 

85 

220: 

73 

020: 

97 

120: 

101 

201: 

97 

001: 

116 

101: 

117 

111: 

79 

211: 

86 

on: 

106 

021: 

113 

121: 

81 

221: 

102 


527 


529 


577 


Total 
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discover the type of fertilizer application that would stimulate the early 
growth so that the young trees would be ready for transplanting to com- 
mercial orchards by early autumn. The data shown are the mean heights 
in centimeters of the 12 timg trees on each plot. For convenience in 
following the computations, the plot yields follow the same order as in 
table 6.13. Note that factor /I appears last in table 6.13 and first in 
table 6.16. 

The three factoi\s were as follows, all dressings being per acre. 

R C 

8“8“6 fertiliser applied 
as side-dressing 

0 None 0 None 

1 Tung meal (650 lb.) 1 200 lb. 

2 Cottonseed meal (650 lb.) 

The vside-dressing was placed in a shallow furrow on one side of the 
row about 8 inches from the tiing seedlings. Factors A and B were ap- 
plied at planting (March 13) , factor C on June 3, and the height measure- 
ments were made on October 23. It should be noted that all three fac- 
tors supply each of the common plant nutrients (iV, P, and K), 

In an experiment of this type, one might anticipate that two-factor 
interactions will be px^esent. If, for example, 400 lb. of fertilizer applied 
in the row supply all the nutrients that the young tree can utilize, plots 
that receive this dressing will show no response to either factor B or Cl 
On the other hand, plots that receive the zero level of A may respond to 
the dressings supplied in factors B and C. It would not be wise to con- 
found any two-factor interactions heavily. Since, however, the soils 
available for experiments had been found to be very variable, it was de- 
cided to arrange the experiment in blocks of 6 plots. As we have seen, 
this will partially confound AS, though as the relative information is 
high {J4)} iiot much information is lost on this comparison even if the 
incomplete blocks prove ineffective. 

Table 6.16 shows the computation sheet leading to the anabasis of 
variance. The steps are as follows. 

i. Calculate the block totals (denoted by la, I&, etc,, and shown in 
table 6.15), and the treatment totals (table 6.16). Calculate the total 
s.s. and the blocks s.s. The latter has been divided into two com- 
ponents — ^replications (3 d.f.) and blocks within replications (8 di.). 
This separation enables us to estimate the decrease in error variance due 
to the reduction in block size from 18 to 6 plots. The analysis of variance 
appears at the foot of table 6.15. 

ii. Form the two three-way totals as shown. From the margins of 
the table for {ci + Cq), calculate the sum of squares for the main effects 


A 

8-8-6 fertilizer applied 
in the row 

0 None 

1 200 1b. 

2 400 1b. 
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TABLE 6.10 Computation sheet for 3 X3 X2 experiment, blocks of 

6 units 

(i) Treatment totals 


1 

CO 

Cl 


ho 

h 

h 

h 

h 

h 

«0 

200 

248 

303 

364 

341 

345 

ai 

336 

337 

339 

369 

352 

299 

«2 

375 

324 

294 

340 

281 

315 


(ii) Two-way tables 




a 4 

CO 

, 

Cl — Co 


h 

t>i 

h 

Totals : 

60 

hi 

h 

Totals 


564(1) 

580(4) 

648(7) 

1,801 

+164(1) 

+93(4) 

+42(7) 

299 


705(2) 

080(5) 

688(8) 

2,032 1 

+ 33(2) 

+ 16(6) 

-40(8) 

8 

a:2 

715(3) 

605(0) 

609(9) 

1,929 : 

- 35(3) 

-43(6) 

+21(0) 

-57 

Totals 

1 ,984 

1,883 

1,895 

5,762 

162 

65 

23 

250 


(ill) From (ci -f- ro) 

h - (1) + (5) -f (9) - 1,862 

h - (2) + (6) + (7) - 1,958 

h - (3) + (4) 4- (8) - 1,942 


t/i “ (1) 4 (6) 4 (8) = 1 ,807 
J2 - (2) 4(4) 4(9) - 1,903 
Jz - (3) 4 (5) 4 (7) - 2,052 


5,762 

2/3^ - 2Ji 4 I« 4 IL - 4,700 
2 / 2 ' - 2/2 4 I/; 4 11/, - 4,743 
2/3' - 2/3 4 L 4 IL 4,755 


5,762 

2 / 1 ' = 2Ji 4 III« 4 IVa - 4,673 

2 / 2 ' - 2/2 4 Illft 4 IVb - 4,838 

2Jn' - 2/3 4 HI. 4 IVc - 5,101 


for AB 


14,198 

(4,700)2 4 (4,743)2 4- ■ 


• 4 (5,101)^ 


14,612 

(14,198)2 4 (14,612)2 


From (ci — Co) 
4^1= 4200, E 2 - 
where, e.g., 


= 1,129 


432, Rz = 418; >81 - 481, ^'2 = 4147, /Sg == 422 

Ri = (1) 4 (5) 4 (9), • - Sz - (3) 4 (5) 4 (7) 

2/?i' - 2Ri I& 4 I. 4 II& - II. - 4306 
2 R 2 ' - 2 R 2 4 In -Tc- lla 4 lie = 4153 
2Ih^ - 2i?3 - la 4- Ib 4 Ila - Ilb = 4 41 

4600 

- 2Si - Illb 4 HI. 4 IVb - IVc - 4293 
« 28^2 4 Ilia - HI. - IV„ 4 IV. - 4132 
2^83 ~ IIL, 4 III& 4 IV« - IVb - 4 75 


(4306)^ 4 (4153)^ 4 • * • 4 (475)^ 


4500 

(4500)‘^ 


1.016 


60 


90 
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of A and B. From the total of the table for (ci — Co), we calculate the 
sum of squares for the main effect of C: this is (250)^72/ since there are 
72 plots. Similarly, the margins of the table provide the sum of squares 
for the AC and BC interactions, which being unconfounded are obtained 
in the usual way. 

iii. These steps lead to the sum of squares for AB. First calculate the 
I and / components of ilB, which are obtained by summation from the 
(ci + Co) table in step ii, as shown in table 6.16 (iii). Then apply the ad- 
justments for block effects. Since Ji, for instance, does not appear in 
blocks la or Ila, the adjustments involve the totals for these two blocks. 
The adjustments lead to the quantities 2Ji', etc., from wFioh the sum of 
squares for follows as shown, 

iv. These steps lead to the sum of squares for ABC. From the table 
for {ci — Co), step ii, form totals Ri and which are calculated by the 
same rides as for the /’s and /’s, respectively. These totals are then ad- 
justed for block effects, yielding quantities 2/2i', 2;Si', etc., from which 
the ABC s.s. is easily obtained. The error s.s. is then computed by sub- 
traction. 

V. Summary and presentation of results. Consider first the con- 
clusions indicated by the analysis of variance. The only significant mean 
square for a main effect is that for C (side-dressing). The AC interaction 
is significant at the 1% level, while the mean square for d., though not 
significant, is substantially above the error m.s. Neither B (meals) nor 
any of its interactions approaches significance. These results suggest 
that the AC two-way table should be examined. 

Since table 6.17 contains no confounded effects, the means per plot 
are found simply by addition and division from table 6.16, (i). It is 


TABLE 6.17 AC two^-way table 


Row dressing 

i Mean heights (centimeters) ± 4.15 

Side dressing 

■v.co:'. 

'';^;>:None, ' \ ' 

/ Cl'-. 

uq None 

62.6 


ai 2001b. 

84.3 

85.0 

az 400 lb. 

82.8 

78.0 
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evident that there are large and significant responses to the 200 lb. dress- 
ings, both when applied in the row and as side-dressing. The means, 
84.3 for the row application and 87,5 for the side-dressing, do not differ 
significantly, so that the applications appear to have been about equally 
effective. There is no further increase to the 400 lb. row dressing, or to 
mixtures containing both types of dressmg. Thus the large AC inter- 
action arises primarily because each dressing is effective only in the 
absence of the other. 

The analysis of variance would indicate that no definite effects of the 
meals (B) can be established. As an exercise, the AB two-way table, 
which involves partially confounded effects, will be examined. The 
means must be adjusted for block effects. This could be done by first 
calculating a mean for each incomplete block, adjusted for treatment 
effects, and then adjusting each AB mean for the 4 blocks in which it 
lies. It is slightly quicker to perform the adjustments by means of the 
I and J effects, as follows. 

In table 6.16, (iii), divide the quantities 2Ji', etc., by 42, and calculate 
the deviations from their mean. The quantities 2Ji, etc., are treated 
similarly. The same procedure is carried out for the J’s and J’s, except 
that the divisor is 24. This gives 


ii' = -0.8, 


0.2, 


0.5 


-4.7, 

jY = -0.8, 

is' - 

6.4 

ii « --2.4, 


l.C, 

k = 

0.9 

ji ^ 

-4.7, 

1 

O 

is == 

5.5 

Hi - -fl.6, 

— 

-1.4, 

din “ 

-0,4 


0.0, 

sh = -0.1, 

A/s =* 

-0.1 


The unadjusted mean of (ao6o) is 70.5. From table 6.16, (ii) and (iii), 
we see that (ooi>o) belongs to Ii and Ji. Hence its adjusted value is 


(a-ol-o)' - (ao&o) + Hx + tji = 70.5 -h 1.6 -f 0.0 = 72.1 

By the same principle, the adjustment to {(h^x), for instance, is 
(Si 2 -}- iji). The complete two-way table is shown below. 

TABLE 6.18 AB two-way table, adtosted eob block epebcts 


E,ow dressing 

Mean heights (centimeters) 

h 

None 

h 

Tung 

b2 

Cottonseed 

ao None 

72.1 

73.1 

79.5 

ai 200 lb. 

86.6 

87.6 

79.4 

az 400 ib. 

88.9 

74,2 

77,6 
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The standard error of a single figure in the table is V207/8, or 5.09, 
since each figure is the mean of 8 plots. This standard error applies to 
^tests of unconfounded effects; i.e., of the main effects of and B. For 
ig-tests of components of the AB interaction, the figure must be multiplied 
by VS/7, or 1.069, the value 8/7 being the inverse of the relative elR- 
ciency on AB, 

For ^-testspf quantities that are a mixture of main effects and inter- 
actions, the multiplier lies between 1.069 and 1 . It will usually be 
sufficiently accurate to use the multiplier 1.069. If desired, the correct 
standard error may be obtained by means of a rule which will be given 
without proof. The quantity whose standard error is w^aiited is expressed 
in terms of components of main effects and interactions. Suppose that 
we wish to test the response to cottonseed when no row fertilizer is 
applied. The response is (79.5 — 72.1), or 7.4 cm. The approximate 
standard error is 5.09 X 1.069 X \/2, or 7.69. To obtain the correct 
value, we write symbolically, 

(63 bo)(% + ^1 + <^ 2 ) , (f>2 ^o)(2^o — (Z2) 

0)2 - ho)ao = — + — 

If all means are unadjusted means, this is an algebraic identity. The 
first term on the right is a component of the main effect of wdiile the 
second is a component of the AB interaction. Further, by a property of 
least squares, the equation remains valid if means adjusted for block ef- 
fects are substituted for unadjusted means. By the rule for the variance 
of a linear function, the variance of the first term on the right is 6cr^/9, 
where is the variance of a single figure. By the same rule, the variance 
of the second term is 



the factor 8/7 being introduced because the second term is a component 
of riB. The two terms are orthogonal, so that the estimated standard 
error of (62 ~~ ?>o)^o is 

5.09 Vf + (-V-)(f) = 6.09 V2, 190 = 7.53 

xAdjustment of the 18 individual treatment means is more tedious. In 
table 6.16, (iv), we divide the quantities 2Ri\ etc., by 30, and the quan- 
tities Rif etc., by 24, thereafter taking deviations from the mean of each 
trio. This gives 


n' = +4.6, 

11 

1 

P 

rs' » —4.2 

«i' = + 4 . 2 , 


- 1 . 2 , 


-3.1 

n = +4.8, 


rg =K* —2.7 

Si —0.1, 


+2-0, 

S3 - 

-2.6 

6ri = -0.2, 

Sr2 ~ Hhl*7j 

Srg ~ —1.5 

5si =■ +4.3, 

5^2 — 

-3.8, 


-0.5 



Ill this experiment there is little point in adjusting all 18 treatment 
means. However, the unadjusted treatment totals, table 6.16, (i), sug- 
gest that there might be an effect of the meals (factor B) when applied 
alone. Hence it may be of interest to test the difference between the 
adjusted means of ^062^0 and aoMo- 

From table 6.16, (iii) and (iv), we see that aohco belongs to ii, ji, n, 
and $1, Since the unadjusted mean is 50.0, the adjusted mean is 

50.0 “4“ 5^1 + 8ji — = 

50.0 4- 1.6 + 0.0 + 0.2 - 4.3 = 47.5 cm. 

Note that the terms in 8r and ds carry a minus sign whenever the treat- 
ment is at the co level, and a plus sign when the treatment is at the Ci 
level. For a{)h 2 Co, which belongs to ?‘2, ^*3? ^'2; S3, the adjusted mean 

will be found to be 73.1 cm. 

In order to obtain the standard error of the difference (73.1 47.5) 

= 25.6, this must be expressed in terms of components of main effects 
and interactions. This requires some practice, but the method is not 
hard to master. Consider the following table of values of {1)2 — 5o), 
averaged over the 4 replicates. 


, ■ ' ■ 1 

(1)2 — 
ao at 

h) 

ua 

\ Total 

Co 

d 

e 

/ 

i 1 

Cl 

9 

h 

k 

m 

Total 

n 

0 

V 

9 


42^- 

ABC 


’h-m 


The quantity whose standard error we wish is d. Now it is easy to 
that 

= g+ (I — w) + i2n — 0 — p) + [(2<i — c —f) • 
components of B BC AB 

As before, this is an identity for the unadjusted means and remains valid 
for adjusted means. Hence by the same rule as before 

O 




12+ 12 + 24(^5) + 24 I 


22.4(jtr2 


where cr^ is the estimated variance per plot. The extra factor 8/7 is used 
the AB component and the factor 8/6 for the ABC component, for 
which the relative information is 5/8. Consequently, the estimated 
standard error of d is 
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= 11.4 cm. 
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It follows that the ^-value for the response to cottonseed when applied 
alone is 25.6/11.4, or 2.25, wdth 43 d.f,, which is significant at the 5% 
level. 

These results illustrate a feature that is common to the mixed series of 
confounded factorials. If the examination of the results is directed 
mostly at simple rather than factorial effects, the analysis is likely to be 
time-consuming because of the adjustments needed to remove block ef- 
fects and to construct standard errors by the method given above. It 
will also be apparent that considerable experience in the analysis of 
variance, or access to expert advice on this topic, is advisable. 

6.2 The Use of Confounded Designs 

6.21 General Recommendations. In order to make efficient use of 
confounding, it is necessary to consider whether the advantage is likely 
to outweigh the disadvantages. 

The advantage comes from the reduction in the experimental error 
by the use of a block wvhich is more homogeneous, or which can be sub- 
jected to a more unffoi’m technique, than the complete replicate. With- 
out some idea of the amount of this reduction, a realistic decision cannot 
be made on the question of confounding. If uniformity data have been 
collected, the experimenter may compare the variability within the in- 
complete blocks with the variability within replicates. In addition, 
from the results of an experiment in which confounding has been em- 
ployed, it is usually possible to estimate what the experimental error 
■would have been if confounding had not been used. The method of cal- 
culation, which was first outlined by Yates (6,5, p. 215), is given in sec- 
tion 6.35. In many types of research the reduction in error from con- 
founding varies greatly from experiment to experiment. In such cases 
little reliance can be placed on the results of a single trial; a suiamaiy of 
a moderately large group of experiments is necessary. 

The disadvantages of confoimding consist of (1) the reduction in 
replication on the confounded treatment comparisons, and (2) in some 
cases, a greater complexity in the calculations. With regard to the first, 
the experimenter should take into account the importance of the inter- 
actions that must be confounded and the extent to which they are con- 
founded. No interaction should be completely confounded unless there 
is good reason to believe, either from previous experience or from the 
nature of the factors to be tested, that the interaction w411 be negligible. 

As an illustration of the type of reasoning that might be employed in 
reaching a decision, consider the 3 X 3 X 2 design discussed in the pre- 
vious section. The incomplete blocks contain 6 units, with 7/8 relative 
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information on AB and 5/8 relative information on ABC, ABC is ex- 
pected to be negligible 1 however, is of sufficient interest so that con- 
founding is undesirable if it involves any loss of precision for this com- 
parison. The issue resolves itself into the question: is the error variance 
per unit with blocks of 6 units likely to be less than 7/8 of the error vari- 
ance with blocks of IS units? If so, the AB interaction is estimated more 
precisely from the confounded design than from randomized Idocks, be- 
cause the reduction in variance per unit more than compensates for the 
loss of replication due to confounding. Since the relative information is 
useful for thi>s type of argument, it is given in table 6.19 for all the plans 
presented at the end of this chapter. 

The increase in the amount of computation varies both with the type 
of design and with the type of analysis needed to summarize the results. 
Wliere all the confounding is mnpkte, the statistical analysis is prac- 
tically the same as with randomized blocks. With all factors at two or 
three levels (i.e., in the 2’^ and 3^' series), the extra computations are not 
formidable even when partial confounding is used; they are iniicli more 
troublesome when the number of levels is not the same for all factors, 
as in the 3 X 2^ and 3X2^ designs. The extra work consists in the cal- 
culation of adjustments to the partially confounded comparisons so that 
their sums of squares in the analysis of variance are fz*eed from block 
effects. If it is desired to present tables of mean yields which involve 
confounded comparisons, the entries in the tables must bo adjusted so 
as to remove block effects. This point should be remembered, especially 
with designs where two-factor interactions are confoimded. If j^-tests of 
simple effects are needed, Kspecial computations must be made to obtain 
the standard errors for these tests, Moreover, if part of the data is miss- 
ing, the analysis tends to become much more complicated than with 
randomized blocks. 

The experimenter who has not previously used confounding and who 
does not have access to expert statistical advice is advised to confine him- 
self at first to the simplest types of confounding. He should make sure 
that he understamls the method of analysis hefore the experimmt is mn- 
ducted. After the experiment is completed, he should investigate the in- 
ci*ease in precision that has been achieved by confounding, as a guide to 
the future use of the technique. 

Yates (6.6) has pointed out that in certain circumstances a confounded 
design might give rise to a misleading interpretation of the results. This 
can be illustrated by means of the 2^ factorial in table 6.1 (p. 154). This 
has three replicates, with ABC completely confounded in each replicate. 
Suppose that, the response to A is much larger than any other effects, 
and that this response varies greatly from block to block. That is, the 
interaction of A with blocks is much larger than the interactions of the 
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other factors with blocks. Now the factorial effect total for BC may be 
written 

BC = (abc) + (be) + (a) + (1) --- (ah) - (oc) (h) - (c) 

- [(abc) + (a) ^ (b) ^ (c)] - [(ab) + (ac) - (he) -- (1)] 

== response to A in blocks 1, 3, 5 ~ response to A in blocks 2^ 4, 6 

The point is that the factorial effect of BC is a component of the inter-- 
action of A with blocks. When tested against the error ni.s. (12 d,f.) 
this response might be significant^ not because there is an effect of BC, 
but becaiise ^4 has large interactions with blocks. 

Another way of describing the source of the trouble is to say that the 
error m.s. is not homogeneous, in that the interaction of A with blocks 
is larger than the interactions of other treatments. If this were sus- 
pected, the danger of a misleading interpretation could be avoided by 
subdividing the error m.s. We can isolate 4 d.f. (two from blocks 1, 3, 
and 5 and two from blocks 2, 4, and 6) that represent the interaction of 
A with blocks. The mean square for these four could be used to test A 
and BC, while the remaining factorial effects would be tested by the other 
8 di . for error. This device can usually be applied in the 2^ series, though 
generally not in the mixed series. Further, the heterogeneity of the 
error variance may not be suspected. 

How frequently this danger arises will depend on the nature of the 
experimentation. Investigations by Yates (6.5) and Kempthorne (6.9) 
suggest that there is little danger with fertili:zer experiments on English 
farm crops. In lines of research where errors have often been found to be 
heterogeneous, one should be on the alert for this effect. 

In some experiments the use of confounding is practical^ imavoidable/ 
For instance, an experiment on methods of mamifacturing ice cream, 
planned at North Carolina State College, contained 3 factors each at 3 
levels. It w^as desired to discover whether ordinary consumers coulci 
detect differences in the palatability of the 27 different types of ice cream* 
From previous experience it was believed that a taster (who was not an 
expert judge) could compare at most 3 ice creams at any single test* 
Moreover, it did not seem feasible to ask the tasters to gi'ade the pal- 
atabilities on any kind of standard scale: they could merely be ask^d to 
rank the 3 ice creams in order of preference, if they had any preferepcesw 
Consequently, a design for a 3^ factorial in blocks of 3 units was re- 
quired, even though this involved confounding of two-factor interactions. 
The design adopted also employed partial confounding of the main ef- 

I fects, so as to obtain approximately equal accuracy on main effects and 
two-factor interactions. 
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TABLE 6.19 fNmx to plans op factorial experiments confounded in 

RANDOMIZED INCOMPLETE BLOCKS 

a. Designs with which any number of replicates may be used 


Num- Num- 
ber of her of 
treat- units per 


Type 

meats 

block 

Interactions confounded in a single replicate * 

Plan 

2* 

S 

4 

ABC 

6.1 

2'^ 

16 

8 

ABCn 

6.2 

2^ 

16 

4 

AB, ACT), BCD 

6.4 

2^ 

32 

8 

ABC, ADE, BCDE 

6.5 


64 

16 

ABCD, ABEF, CDEF 

6.3 

26 

64 

8 

ABC, CDE, ADF, BEF, ABDE, BCDF, ACEF 

6.6 

36 

27 

9 

ABCiH) 

6,7 


81 

9 

ABC, ABD, ACD, BCD, all Oi) 

6.8 

42 

16 

4 

AB {%) 

6,12 

4 X 22 

16 

8 

ABC (H) 

6.13 


* The fractions in parentheses give the relative information on the comparisons 
wMch are confounded Where no fraction is given, the comparison is comjileteiy 
confounded. 

h. Balanced designs 

Number of 
replicates 

Number Units fora Interactions confounded and 



of treat- 

per 

balanced 

relative information (in paren- 


Type 

ments 

block 

design 

theses) * 

Plan 

2 ^ 

16 

4 

6 n t 

All two-factor ah three-fac- 

tor(}4) 

6.4 

26 

32 

8 

bn 

All thrcG-factor {%)\ ah four-fac- 
tor (f4) 

6.5 

2 “ 

64 

8 

lOn 

All three-factor (fi); all four-fac- 
tor {j4) 

6.6 

3* 

27 

9 

4r 

All three-factor (?^) 

6.7 

3* 

81 

9 

4n 

All three-factor 

6.8 

3X2* 

12 

6 

3r t 

BC {%), ABC (.%) 

6.9 

3 X 2 * 

24 

6 

3n t 

BC, BD, CD, all (?^); ABC, ABD, 
ACD, all {%) 

6.10 

3*X2 

18 

6 

4n 

AB (H), ABC m 

0.11 

4* 

16 

4 

Zn 

AB 0^ 

6.12 

4X2= 

16 

8 

Z 71 

ABC VM) 

6.13 

4X3X2 

24 

12 

9n t 

AC -'iBC i^H^) 

6.14 


* The factors ABC * * * are read from the left; thus the BC interaction in a 3 X 2'-^ 
design is the interaction between the 2 factors at 2 levels. 

t The symbol denotes that the number of replicates should be a multiple of 6. 
J In these cases only the balanced design is recommended. 
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6.22 Index to Experimental Plans, The opportunities for confound- 
ing vary with the number of factors and with the number of levels of each 
factor. The plans given at the end of this chapter are those that are 
likely to be most frequently used. 

Table 6. 19, which forms an index to the plans, is divided into two parts. 
The first part shows a group of designs in which any number of replicates 
may be used. In successive replicates, the experimenter may either (i) 
repeat the first replicate (with a new randomization), in which case the 
same comparisons are confounded in all replicates, or (ii) change the 
grouping into blocks in successive replicates, so as to spread the con- 
founding over a greater number of comparisons. In the plans, the fac- 
torial effects that are confounded are shown separately for each replica- 
tion. Consequently, the reader can readily see what possibilities are 
open in this direction. Where confounding can be restricted to a high- 
order interaction, as in the 2'^ desigir in blocks of 8 units, the first pro- 
cedure is preferable. On the other hand, Avith the 2^ design in blocks of 
4 units, Avhere one two-factor interaction must be confounded in every 
replicate, the confounding should be changed in successive replications, 
unless it is certain that one of the two-factor interactions will be neg- 
ligible. More detailed recommendations for the individual designs are 
given in section 6.3. 

The second part, table 6.196, shows a group of balanced designs, in 
which all interactions of the same order and the same S-i'® con- 
founded to the same extent. Thus in the 3 X 2 X 2 X 2 (or 3 X 2^) 
design, all first-order interactions between the factors Avhich have 2 levels 
are equally confounded, the relative information being 8/9. The first- 
order interactions between the factor at 3 levels and any factor at 2 levels 
are unconfounded. Balanced designs are useful in cases where all inter- 
actions of the same order and type are of equal interest, so that it is un- 
desirable to confound some more fully than others. Also, the computa- 
tions are usually simpler. For this reason, only the balanced designs 
should be used in certain cases, as indicated in table 6.19. On the other 
hand, with balanced design^s the available munbers of replications are 
severely limited. 

Table 6.20 shows a number of designs for which plans are given in the 
references cited. These designs are mainly for mixed series, and will not 
be discussed here. 

6.23 Randomization. The procedure is the same for all designs. 

1. Arrange the positions of the blocks at random within each replica- 
tion, using a new randomization for each replication. 



Blocks 3 

Main effects 6 

Two-factor interactions 10 

Error (from high-order interactions) 13 

Total 31 
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2. Allot the treatments in any block at random to the units in the 

block, using a new randomization for each block. 

TABLE 6.20 Conitotinded designs fob otheb eactobiad experiments 


Type 
32 X 2^ 
S'* X2 
X2 

4 X 32 

42 X 2 
4- X3 

43 

4 X 2^ 
4* 

52 

5 X 22 


Units 

pel- 

block 

12 

18 

6 

12 

16 

12 

16 

8 

16 

5 

10 

25 


Number of 
replicates 

2n 

2ti 

2n 

2n 

Any 

371 

Any 

3n 

Any 

4n 

5n 

Any 


Interactions confounded Reference 

AB m, AB€D 6.6 

ABC\ABCD 6.6 

AB, AC, BC (Vs), ABD, ACD, BCD, ABC 6.1 


BC(ys),ABC 

ABC 

AB{^%^),ABC 

ABC 

ABC, ABD, ACD 
ABC, ABD, ACD, BCD 

AB{%) 

BC (^H^), ABC 
ABC 


6.6 

6.6 

6.6 

6.7 

6.6 

6.7 

6.8 
6.6 

6.8 


6,24 Experiments in Single Replication. In most of the plans at the 
end of this chapter, the total number of treatment combinations is large. 
This fact limits the number of replications that can be employed. Some- 
times the resources are sufficient for only a single replication unless the 
experimenter omits some factors which he originally intended to include. 

With only oaie replication, we cannot derive an estimate of error fr<3m 
the interactions of treatments with blocks. If, however, certain high- 
order interactions are negligible, their mean squares in the anatysis of 
variance will behave exactly like components of the error m.s., and there- 
can be used to provide an estimate of error. In the 2^^ series, the 
smallest design for which a single replicate is likely to be used is the 2’^ in 
blocks of 8 units (plan 6.5). In replicate 1 of this plan, ABCf ADE, and 
BCDE are completely confounded. If the remaining three-, four-, and 
five-factor interactionvS furnish tlie estimate of error, the analysis of vari- 
ance appears as follows. 

di. 


m 
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I Other designs ill which a sufficient number of error d.f . are available i 

! from interactions among three or more factors are the 2 ®, the 3 ^ , and the s 

i 3"^. The interactions that constitute the estimate of error should be ; 

i chosen before the results have been inspected, Eecommendations for ; 

I each of the designs above are given in the notes on the plans (section I 

6.3) . It may sometimes be wise to deviate from these recommendations. 

I For instance, in the 2^ example above, the main effects of A, C, and D ^ 

I and the AC and dD interactions might all turn out to be rather large. ■ 

j This would suggest that the ACD interaction may not be negligiblej and 

I that it should not be included in the interactions used as error. It will i 

be realized that a decision of this type should be made before examining ;; 

ACD itself. The practice of examining the high-order interactions and 3 

using those that are small as error leads to a serious underestimation of 
the true error variance. 

The estimation of error from high-order interactions is, of course, open | 

to criticism. If some of these interactions happen to be large, the error 
m.s, that is used will overeBtimatc the true error variance and the fact 
that the interactions are large will not be discovered. Where a consid- 
erable series of experiments of the same general type is being conducted, 
some safeguard is obtained both by examining the high-order interactions 
in experiments that are replicated and by watching the two-factor inter- ; 

actions in experiments with single replication. If most two-factor inter- 
actions are small, it seems veiy unlikely (though still possible) that inter- 
actions of higher order will be lai’ge. On the other hand, if many two- 
factor interactions ai^e found to be large, this suggests that some three- ■ 

factor interactions may also be substantial. For further discussion, see ! 

Yates (6.5) and Cornish (6.10). ; 

> 6,25 Experiments with Fractional Replication. More recently several J 

writers, Finney (6.11) and (6.12), Plackett and Burman (6.13), and 1 

Kempthorne (6.14) have considered the use of only part of a replication ; 

in a factorial experiment. Not enough experience has accumulated to 
I ; permit appraisal of the utility of these designs, and only an introduction i 

to them will be given. They are most likely to prove useful where (i) a J 

j large number of factors is tested, so that even a single replication is not • 

; feasible, and (ii) all high-order interactions are known with confidence to 1 

be negligible. ; 

; In ordei‘ to see what happens wffien the experiment contains only part \ 

i of a replication, consider a 2^ factorial in which only the 4 treatment 3 

: combinations a, &, 0 , and abc are tested. This is half of a complete rep- | 

^ licate. It is clear that the three-factor interaction ABC cannot be esti- , I J 

mated at all, since this represents the difference between the *4 treatments 

iii:i::3ll3i::iM^ 
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that are tested and 4 that ha,ve not been tested. Finney (0.12) has called 
this comparison the defming contrasty since it defines the two halves into 
which the replicate is divided. Further, the main effect of .1, as esti- 
mated from the difference between units with a and units without a; i.e., 

{ahc) + (a) - (6) - (c) 


is exactly the same comparison as the BC interaction, as estimated from 
the difference between units that receive an even nimiber of or c’s and 
those that receive an odd number. Similarly, the estimate of B is the 
same as that of AC, while C is the same as A B. Factorial effects that are 
represented by the same treatment comparison are called aliases; thus 
BC is an alias of A, It has been showm (0.12) that the alias of any effect 
is its generalized interaction with the defining contrast. For example, 
the interaction of A wdth ABC is A^BC or BC, This result makes it 
easy to discover the alias of any factorial effect. 

If the experiment shows an apparent effect of A , there is no wmy of 
telling from the results wdiether the effect w’^as due to A, or to the BC 
interaction, or to a mixture of the two. This type of information is too 
vague to be satisfactory. If, however, it can be taken for granted that 
all two-factor interactions are negligible, the ambiguity disappears, since 
w’-e wmuld conclude that an apparent effect of A w^as in fact due to A, 
With these assumptions, the half-replicate enables us to estimate the 
three main effects. 

This example brings out the fact that in order to obtain definite in- 
formation, it is necessary to assume that certain interactions are non- 
existent. The designs discussed by Finney and Kempthorne are adapted 
for situations where all interactions among three or more factors are 
negligible. In those presented by Plackett and Burman, even tW'O-factor 
interactions are ignored. 

We will examine the construction of a half-replicate of the 2^ factorial, 
since this is probably the smallest member of the 2” series for which 
fractional replication might be needed in practice. It is desired to obtain 
information on both main effects and two-factor interactions. First 
divide the treatment combinations into two sets of 32 by means of the 
defining contrast ABODE F, Either set may be chosen as the half- 
replicate. The factorial effects are shown in table 6.21. 

It will be noted that, rather fortunately, every alias of a main effect 
or two-factor interaction is a high-order interaction of a type that is as- 
sumed non-existent. The alias of a three-factor interaction is another 
three-factor interaction. Since all three-factor interactions are assumed 
negligible, these components may be used to estimate error. Conse- 
quently, this plan, if set out as a completely randomized design in 32 
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units, enables all main effects and two-factor interactions to be estimated 
and provides 10 d.f. from three-factor interactions for the estimation of 
error. 

Since a completely randomized design is rather inaccurate for many 
types of experiment, an arrangement in blocks of 16 or 8 units may be 
preferable. If the 32 units are divided into two groups of 16, one fac- 


TABLE 6.21 Structure of a 2** FA<;'l'ORIAL (hALF-REPUTCATE) 


Main effects 

Alias 

Main effects 

Alias 

A 

BCDEF 

D 

A BCEF 

B 

ACDEF 

E 

ABCDF 

C 

ABDEF 

F 

ABODE 

Two-factor interactions 

Alias 

Two-factor interactions 

Alias 

AB 

CDEF 

BF 

ACDE 

AC 

BDEF 

CD 

A BEF 

AD 

BCEF 

CE 

ABDF 

AE 

BCDF 

CF 

ABDE 

AF 

BCDE 

DE 

ABCF 

BC 

ADEF 

DF 

ABCE 

BD 

ACEF 

EF 

A BCD 

BE 

ACDF 



Three-factor interactions 


Three-f ae tor interactions 


(used as error) 

Alias 

(used as error) 

Alias 

ABC 

DEF 

ACE 

BDF 

ABD 

CEF 

ACF 

BDE 

ABE 

CDF 

ADE 

BCF 

ABF 

CDE 

ADF 

BCE 

ACD 

BEF 

AEF 

BCD 


torial effect and its alias are confounded with groups. The best choice is 
one of the aliases from the set used as error, e,g., ABC and DEF. The 
plan is easily constructed. Suppose that the set of 32 which contains 
abedef has been chosen as the half-replicate. This is divided into two 
groups of 16 such that one group (bioc.k 1) contains all treatment com- 
binations carrying an even number of the letters a, or c. The plan and 
analysis are shown in table (,).22. 

For a design in blocks of 8 units, we must confound with blocks an- 
other alias pair from the compa,risons that are used as error. Suppose 
that ABD and its alias CEF are selected. It will be recalled, (section 
6.13) that the generalized interaction of this pair with ABC and DEF is 
also confounded. By the usual rule, this interaction is the alias pair CD 
and ABEF, It is imfortunate that a two-factor interaction is con- 
founded, but further trial shows that this is unavoidable in blocks of 8 
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TABLE 6.22 A half-replicate of a 2® factorial in blocks of 16 units 


Block 1 

Block 2 



(1) 

ad 



de 

ae 



# 

af 



4 

hd 



ah 

he 


d.f. 

ac 

bf 

Blocks 

1 

he 

cd 

Main, effects 

6 

ahde 

ce 

Two-factor interactions 

15 

abdf 

4 

Error (from three-factorial 


ahef 

adef 

interactions) 

9 

aede 

hdef 


*— 

aedf 

edef 

Total 

31 

acef 

abed 



hede 

ahee 



hedf 

ahef 



hcef 

abedef 




Note that the error d.f. are reduced from 10 to 9, since 1 d.f. has been assigned to 
blocks. 

units. To construct the plan (table 6,23), divide each block of table 6.22 
into halves that contain respectively an odd and even number of the 
letters c and d. 


TABLE 6.23 A half-bepligate op a 2® factorial in blocks op 8 units 




Block 




1 

2 

3 

4 



(1) 

de 

ae 

ad 


d.f. 

4 

df 

4 

hd 

Blocks 

3 

ah 

ac 

he 

ce 

Main effects 

6 

ahef 

be 

bf 

4 

Two-factor interactions 

14 

aede 

ahde 

cd 

abce 

Error 

8 

aedf 

ahdf 

abed 

ahef 


— 

hede 

acef 

edef 

adef 

Total 

31 

bedf 

hcef 

ahedef 

hdef 




In practice, the letters should be allocated to the factors so that CD 
is the two-factor interaction that is considered least important. In the 
analysis of variance, the sum of squares for blocks, main effects, and the 
unconfounded two-factor interactions are all computed in the usual way. 

Other experiments for which plans are described in reference (6.12) are 
a half-replicate of a 2^ factorial in blocks of 8 units, a half-replicate of a 
2^ factorial in blocks of 16, a quarter-replicate of a 2^ factorial in blocks 
of 16, and a third-replicate of a 3^ in blocks of 9. 
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For the case where two-factor interactions are ignored, Plackett and 
Burman (6.13) give methods for constructing a large number of plans in 
which all main effects can be estimated. The only restriction is that the 
number of units tested is a multiple of 4. For instance, there is a plan 
for a 2^ factorial, using only 8 units (one-sixteenth of a replicate). All 
seven main effects can be obtained, though no comparisons remain for 
an estimate of error. There are further plans in 12, 16, 20, etc., units, 
giving 4, 8, 12, etc., d.f. for error. All these plans are set out for a com- 
pletely randomized design: with further confoimding a reduced block 
size could be employed. As an illustration the plan with 16 units is given 
below. To facilitate study of this plan, the treatment combinations are 
sho\vn both in our usual notation and by means of -f and — signs to 
denote the two levels of each factor. 

Every factor occurs 8 times at each level. For every pair of factors, 

each of the sign combinations, -h-f, -f — , — 4-, and ; , occurs four 

times. Thus the estimates of all seven main effects are mutually orthog- 
onal, and are calculated in the usual way. With complete randomiza- 
tion, 8 d.f. are available for the estimation of error. 


TABLE 6.24 2’ Pactoeial in 16 units (onb-bighth replicate) 






Treatment combination 



Unit 

Our 

notation 



+ and 

— notation 




a 

h 

c 

d 

e 

/ 

0 


1 

:2 '. 

(1) 

dg i 

- ' 

- 


+ 


— ■, ( 

4 

3 

cf? 

— 


4- 



4- 

4 

4 

cdf 


— 

+ 

4* 


4 


5 

hef 

-- 

+ 



4h 

4 


6 

bdefg 

1 .. — ■: 

+ 


-t- 

+ 

4 

4 

7 

hceg 


•f 




_ . 

4 

8 

hcde 


+ 


•f 

-h 



9 

m 

i + ' ' 


" , - ' 


4“ 

4',' ■ " 


10 

adeg 




4” 

■ ■ 4" 

^ 

4 

11 

acefg 





+ 

4 

4 

12 

acdef 


; ^ 

4“ 

4- 

4- 

4 

■■ V— 

13 

ahf 


+ 

— 



4 


14 

ahdfg 

+ 

+ 


4. 


4 

4 


ahcg 


+ 

4- 




4 


■f , 




4- 
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Since it is not clear how frequently all factors can be assumed to be 
additive in their effects, the utility of these designs remains to be seen. 
As Yates (6.5) lias pointed out, one case where there is additivity occurs 
when each factor represents an object to be weighed. Suppose that 
weighings are done on a chemical balance. Testing the + level of a 
factor corresponds to placing the object in the right-hand pan; testing 
the - level corresponds to placing the object in the left-hand pan. 

Thus the rows of table 6.24 provide a set of instructions for perform- 
ing 16 successive weighings on 7 objects. In the first weighing, all ob- 
jects are placed in the left-hand pan; consequently the weight that is 
read is the total weight of all 7 objects. In the second weighing objects 
d and g are placed in the right-hand pan and the others in the left-hand 
pan, and so on. This series of 16 weighings enables us to estimate the 
W'eight of each object as precisely as if all weighings had been devoted to 
that object alone. To make the analogy complete, a balance with a zero 
error would be needed, the zero error corresponding to the mean yield of 
an experiment, which can also be estimated from the 16 results. For a 
discussion and references, see Mood (6.15). 

6.3 Notes on the Plans and Statistical Analysis 

6.31 2^ Series. In plans 6.1 to 6.6, one level of a factor is denoted 
by the corresponding letter; the other level by the absence of the letter. 
With designs that are not balanced, care must be taken to allocate the 
letters a, 6, c, ... to the factors in the expeilment so that the interactions 
which are confounded are those that the experimenter has decided to 
confound. For instance, the first replication of plan 6.4 (2'^ design in 
blocks of 4) confounds the interaction ABj but no other two-factor inter- 
action. The factors labelled A and B should be those whose interaction 
is of least interest. 

The statistical analysis follows the same general method for all 2’^ de- 
signs. The total s.s. and the sum of squares among blocks are obtained 
by the usual methods. The factorial effects may be computed by one of 
the methods outlined in section 6.14. When this calculation has been 
completed; the sums of squares are added for all unconfoimded compar- 
isons. Let this total be Sr^ If all the confounding is complete, Sr is the 
treatment s.s., since the effects that are confounded appear already 
in the block s.s. For each comparison which is partially confounded, the 
sum of squares is calculated separately those replications in which the 
comparison is unconfounded. The replications to be used are clear from 
the plans, which show the factorial effects that are confounded in each 
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replication. The total of these sums of squares is added to St to give 
the treatment s.s. The error s.s. is found by subtraction. 

factorial, blocks of units. Plan 6.1, which confounds ABC com- 
pletely, may be used for every replication. For a worked example, see 
reference (6.1), p. 19. 

Alternatively, with 4, or a multiple of 4, replicates, we may confound 
partially all two-factor interactions and the three-factor interaction, the 
relative information on each comparison being 3/4. The plan and sta- 
tistical analysis are described in reference (6.1), p. 21. 

factorial, blocks of 8 imits. Plan 6.2 should be used for every replica- 
tion, ABCD being completely confounded. 

Note that the treatment s.s. contains only 14 d.f. since the single de- 
gree of freedom for ABCD is included in the blocks. 

factorial, blocks of 4 units. Plan 6.4 gives a balanced design with 6 
replications. If a smaller number of replicates is used, the replicates 
chosen depend on the interactions which the experimenter decides to 
confoimd. Since one two-factor interaction is confounded in each rep- 
licate, it will usually be preferable to confound different comparisons in 
successive replications. For instance, if AB and CD are to be con- 
founded, replicates 1 and 6 should be selected. This arrangement gives 
1/2 relative information on AB, CD and the three-factor interactions 
ACD, BCD, ABC, ABD, all other comparisons being clear of blocks. 

With a 2^ experiment, the investigator may be uncertain whether to 
confound in blocks of 8 units or in blocks of 4 units. The former design 
is recommended unless (a) the interactions which are confounded with 
blocks of 4 units are known to be small or (6) the block of 4 units is much 
more homogeneous than the block of 8 units. The latter condition may 
apply where the experimental material is extremely variable, or falls 
naturally into groups of 4 units. 

2^ factorial, blocks of 8 units. The balanced design (plan 6.5) requires 
5 replicates. When fewer than 5 replicates are used, it may be satis- 
factory to confoimd completely two three-factor interactions and one 
four-factor interaction. In this case replication 1 is repeated the re- 
quired number of times. 

If some information is desired on all treatment comparisons, replica- 
tions 1,2, • — may be used up to the proposed number. With replications 
1 and 2, for instance, the interactions ABC, ADE, ABD, BCE, BCDE, 
and ACDE are partially confoimded, each vdth relative information 1/2. 

The statistical analysis is described in reference (6.1), p. 27, for the 
case in ivhich the experiment contains only a single replication. 

2^ factorial, blocks of 16 imits. Plan 0.3, ivhich shows a. single replica- 
tion with ABCD, ABEF, and CDEF confounded, may be used for all 
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replications. The 3 interactions above are completely confounded, and 
are omitted from the treatment s.s. in the analysis of variance. 

If there is only 1 replication, an estimate of error may be obtained 
from the unconfonnded four-, five-, and six-factor interactions. These 
comparisons supply 22 di., of which three are confounded with blocks, 
leaving 19 for the estimate of error. The separation of the 63 d.f . is then 
as follows: 

di. 


Blocks 3 

Main effects 6 

Two-factor interactions 15 

Three-factor interactions 20 

Error (from high-order interactions) 19 


factorial j blocks of 8 units. In plan 6.6, the size of block is reduced 
from 64 to 8 units without confounding any two-factor interactions. 
Four of the 20 three-factor interactions and 3 of the 15 four-factor inter- 
actions are confounded in each replication. The balanced design requires 
10 replications, i.e,, 640 experimental units, and necessitates a large ex- 
periment. 

With a single replication, the 63 di. in the analysis of variance sub- 
divide as follows: 

■■ .d.f. 


Blocks 7 

Main effects 6 

Two-factor interactions 15 

Three-factor interactions 16 

Error (from high-order interactions) 19 


The simplest procedure when there is more than one replication is to 
confound completely a single set of 7 factorial effects by the repeated use 
of replication 1. For partial confounding, the successive replicates in 
plan 6.6 may be taken. In the first 4 replicates no three-factor inter- 
action is confounded more than once. 

6.32 3” Series. From plan 6.7 onwards, the levels or variations of 
each factor are denoted by the figures 0, 1, 2, • ' *, while the factors are 
read from left to right. The treatment 102 implies the middle level of 
the first factor, the lowest level of the second factor, and the highest 
level of the third factor. In an experiment the numbers 0, 1, 2, * — may 
be assigned to the levels or variations of a factor according to any con- 
venient scheme. 

The testing of three instead of two variations of each factor makes 
possible a more thorough evaluation of the effects of the factors. In 
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particular, the 3^ factorial experiment has been much used in investiga- 
tions of the responses to fertilizers, since the effects of each of 3 different 
plant nutrients may be explored at 3 levels of application, which provide 
some information on the shape of the response-curve, 

factorial, blocks of 9 units. In plan 6.7, two of the 8 d.f . for ilBC are 
completely confounded in each replication, so that the balanced design 
requires 4 replicates and provides 3/4 lelative information on 
Unless the three-factor interactions are known to be negligible, in which 
case replication 1 may be used for all replicates of the experiment, it is 
preferable to select the required number of replications in succession 
from the plan. For example, an experiment with 3 replicates gives 2/3 
relative information on 6 of the 8 interaction d.f., and full replication on 
the remaining 2 d.f. As usual, the rule in the analysis of variance is to 
estimate confounded comparisons from those replications in which they 
are hot confounded. An example of the analysis with 4 replicates has 
already been given (section 6.17). 

For an example of the analysis with a single replicate, see reference 
(6.1), p. 53. If the three-factor interactions are used as error, the separa- 
tion of the degrees of freedom is as shown on the left in table 6.25. 


TABLE 6.25 Subdivision or thk degeebs op febedom in a 3'"^ experiment 

WITH SINGLE EEPLICATION 


I, Three-factor interactions 
used as error 

d.f. 


Blocks 2 

Main effects 6 

Two-factor interactions 12 
Error 6 

Total 26 


II. For experiments where all 
effects are approximately 
linear 

d.f. 


Blocks 2 

Main effects 6 

AB: linear X linear 1 

AOi linear X linear 1 

BC*. hnear X linear 1 

Error 15 

Total 26 


The number of error d.f., 6, is rather small. There is an alternative 
scheme for experiments where the effects of the factors are linear, or 
nearly so. Under these circiunstances, the only components of the two- 
factor interactions that are likely to be substantial are the interactions of 
the linear responses to the factors. Hence, nine of the 12 d.f. for two- 
factor interactions are added to the material used for an estimate of error, 
as shown in table 6.25, II. If in addition the linear components of 
the two-factor interaction are large, a further variant is to remove from 
the error the ABC\ linear X linear X linear term. This leaveB‘ i4 di. 
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for error. The calculation of these components should be fairly simple 
with the help of the example worked in section 6.17. 

3^ factorial, blocks of 9 imits. With 4 factors each at 3 levels (plan 6.8) 
the block size is reduced from 81 to 9 units by confoimding three-factor 
interactiouvS. There are 4 three-factor interactions: ABC, ABD, ACD, 
and BCD, each with 8 di . Two components from each set of 8 are con- 
founded ill any replication. If the experiment has more than 1 replicate, 
partial confounding is generally preferable, the required number of rep- 
licates being selected from the balanced set. The balanced design, with 
4 replicatcKS, gives threefold replication on all three-factor interactions. 

Statistical analysis with a single replicaiion. Ajx analysis may be cal- 
culated without difSculty from the following partition of the degrees of 
freedom. 

di. 

Blocks 8 

Main effects 8 

Two-factor interactions 24 

Error (from three- and four-factor interactions) 40 

All sums of squares are calculated in the usual way. For the main effects 
and two-factor interactions, six 3X3 tables ai'e required. The error, 
which is found by subtraction, contains 24 d.f. representing imcon- 
founded three-factor interactions and the 16 di. for the four-factor inter- 
actions. 

This analysis gives no test of the three-factor interactions. If a test 
of ABC, for example, is wanted, the first step is to calculate in the usual 
way the sum of squares for the 8 di. From this we must subtract the 
sum of squares for the 2 di. that are confounded with blocks. 

In plan 6.8, replication I, the 2 d.f. are denoted by ABC IV, Avhere the 
numeral IV refers to replication IV of plan 6.7. If we ignore D, it will 
be found that blocks 1, 6, and 8 contain the same set of 9 treatments as 
block IVa in plan 6,7. Calculate the total yield of these 3 blocks (27 
imits). Similarly, we calculate the total yield of blocks 2, 4, and 9, which 
correspond to block IV6 in plan 6.7, and of blocks 3, 5, and 7, which cor- 
respond to block IVc. The smn of squares of deviations of the 3 totals, 
divided by 27, is subtracted from the sum of squares for the 8 d.f. The 
resulting 6 di, are removed from the error, which now has 34 d.f. 

The reader may verify that the 2 confoimded degrees of freedom for 
BCD (i.e., BCD III) are obtained from the totals of blocks 1, 2, 3, blocks 
4, 6, 6, and blocks 7, 8, 9. In this case A is ignored when plan 6,8 is com- 
pared with plan 6.7. 

. Statistical analysis with more than one replication. If all three-factor 
interactions are presumed to be negligible, they may be combined with 


6.33 


199 


MIXED SERIES 

the error. The sums of squares for blocks, main effects, and tAVo-factor 
interactions are found in the usual way while the remainder constitutes 
the error. 

All 32 components of the three-factor interactions may, however, be 
calculated and tested. Partially confounded comparisons are taken only 
from replicates in which they are unconfounded. For example, if there 
are two replications of plan 6.8, all the data are used in the calculation of 
components A jBG I and ABC II, which are uneonfounded: For /IBC III 
we use only the data from the first replicate of the experiment, and for 
ABC IV only the data from the second replicate. 

6.33 Mixed Series. S X facioiHal, blocks of 6 units . Only the bal- 
anced plan (6.9) should be used; this may be repeated if extra replica- 
tions are wanted. Thus the number of replicates must be 3, or some 
multiple of 3. Although the BC interaction is partially confounded, the 
relative information is high (8/9). 

The statistical analysis is described, with a nmnerical example, in 
reference (6.1), p. 58. 

3^ X ^ factorial, blocks of 6 units. In plan 6.11, AB and ABC are 
partially confounded. Either 2 or 4 replicates may be used, though 4 
are required for complete balance. A numerical example for 4 replicates 
was given in section 6.19. 

With 2 replicates (say I and II in plan 6.11) the I and R components 
are partially confounded, while the J and S components are uncon- 
founded. The I and R totals are adjusted by the same formulae as is 
table 6.16, (iii) and (iv). The J and S totals need no adjustment. The 
intei'action s.s. in the analysis of variance are computed as follows. 

ABQ) = ^[(27i0" + + ( 2 / 3 ')"] - TWsWi' + 2 / 2 ' -f 2/3f 

AB{J) = + h^] - MJi +J 2 + Jzf 

ABC{R) = A[(2KiO' + + (2/i30'] 

- + 2 E 2 ' + 2«3f 

ABC{S) — — -^[iS'i + S 2 + fSa]® 

The AB two-way table and the table of individual treatment means 
are adjusted in the same rvay as with four replicates. The quantities 21', 
2R' are divided by 18 and 6, respectively, to obtain i' and r'. For i and 
r, we divide I and R each by 12. After taking deviations from the mean 
in each case, we calculate Si and Sr. Thereafter the adjustnaent proceeds 
exactly as with 4 replicates, being somewhat simpler since J and S com- 
ponents do not enter. 
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3 X factorial, blocks of 8 units. In plan 6.10 the interactions BC, 
BD, and CD, between pairs of factors that occur at 2 levels, are partially 
confounded with 8/9 relative information, while 5/9 relative information 
is retained on the interactions ABC, ABD, and ACD. Only the balanced 

design which requires 3 replicates is recommended. 

The following computations lead to the siun of squares due to partially 
confoimded effects. Let 

gi = Ift “h Ifc — 1(3 IdJ “ la Ifa "h Ic IdJ 

gi" = lo — ~ lo + 

where is the total of block etc. Similar definitions hold for g 2 ,- ' ■ , 
Ps" in the other replications. Then find 

3Q = 3[C'D] + gi + gsA-gs) ^Q' = 3[BL>] + gi + gs + Qs'] 

ZQ" = Z[BC\ + gi' + gz" + g^!' 

ZBq — 3[C'Da] — gi + S'2 + ff3) 32ili = 3[C'Da,] + + 1/2 ~ 

3E2 = 3[C'i)aJ + Pi — P2 + fi'S 

3ffo' = 3[J5DoJ + gi — g^ + gz', ZEx = Z[BDa,^ gi Ar gz' A- 9z'’, 
ZRz' = Z[BDa^ A gi A- 82 ' — gs' 

ZBo''=ZlBCa,]+gi”+g2'’-g3"-, ZRi"^3[BCa,] + gi"-g 2 ''+gz''; 

ZR 2 " = 3[SCJ - gi" + fe" + gz" 

The sum of squares for the two-factor interactions are as follows. 


CD 


576 


n2 


BD = 


(3Q0 

576 


//\2 


BC = 


(3Q^0 

576 


For ACD, divide the sum of squares of deviations of the quantities 3J?0) 
ZRi, ZRz by 120, with similar rules for ABD and ABC. 

The CD, BD, and BC two-way tables must be adjusted for block ef- 
fects. For the means per unit in the BC table, the adjustment is 


ZQ" 

192 


IBC] 

72 


The adjustment is added (algebraically) when b and c are both present 
or both absent; when bore alone is present the adjustment is subtracted. 

4^ factorial, blocks of 4 units. In plan 6.12 each replication confounds 
completely a set of three of the 9 d.f. for AB. Since the relative informa- 
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tion on AB is 2/3, this plan gives less precise estimates of the interactions 
than randomized blocks unless the reduction in block size brings a sub- 
stantial decrease in the error m.s. The balanced design, which requires a 
multiple of 3 replicates, is preferable. 

With 3 replicates the analysis of variance is as follows* 


Replicates 

d.f. 

2 

Blocks within replicates 

9 

Main effects 

6 

Two-factor interactions 

9 

Error 

21 

Total 

47 


To compute the sum of squares for AB, let B^n be the total of the nth 
block in the ??^th replicate, and let T^n be the total of all treatments that 
appear in this block (this will be a total over 12 units). Let = 
Tmn — Bmuf SO that there are 12 quantities Then 
AB - + . . . + P3,2) _ + Pjj2 ^ 

where G is the grand total and the are replication totals. 

The estimated block mean (adjusted for treatment effects) is 

3-Sm7l ' Tmn 


To present a two-way table, the adjusted mean of treatment (00) is, for 
example, 

(00)' = (00) '0-(6i6 + biia + bin ft) 

since this treatment occurs in blocks 16, lla, and III6. 

4 X factorial, blocks of 8 units. In plan 6.13, one of the three com- 
ponents of ABC is confounded in each replicate. In other words, the dif- 
ference between the treatments in block la and those in block 16 is one 
component of ABC. The plan gives 2/3 relative information on ABC. 
Any number of replicates may be used, though three are needed for 
balanced confounding. 

To obtain the sum of squares for ABC, calculate the total for each 
component separately, taking it only from those replicates in which it is 
not confounded. The divisor for the square of this total is obtained by 
the usual rules. Thus with 2 replicates the first component is taken from 
replicate II and has divisor 16, the second component is taken from rep- 
licate I and has divisor 16, while the third component comes from both 
replicates and has divisor 32. 
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If there are f replicates, the adjusted block mean is 

7'B^n ^nin 

where is the block total and that of all treatments appearing in 
the block. With 2 replicates, the adjusted mean of treatment 111 is 

( 111 )' - ( 111 ) 

since this treatment is found in blocks la and 11??. 

4- X s X 2 factorial, blocks of 12 units. The 3 d.f. for the factor A at 
4 levels may be divided into the following components. 


= Us 0^2 — 


«0 


«= ag - a2 - ai + Oo 
^1''' = as — a2 + ai — ao 

In the first 3 replicates of plan 6.14, the interactions yl'C and A'BC 
are partially confounded, with 8/9 and 5/9 relative information, re- 
spectively, the interactions of A" and it '"being unconfounded ; A "C and 
A"BC are partially confounded in replicates 4 to 6 and A'^^C and 
in replicates 7 to 9. Although nine replicates are required for complete 
balance, the design may be used with three or six replicates. In these 
cases the last three or the last six replications should be taken from plan 
6.14. The AC interaction suffers only a trivial loss of replication from 
the confounding. The statistical analysis is described in reference (6.6). 

6*34 Missing Values. When factorial experiments are arranged in 
randomized blocks or latin squares with no confounding, the formulae 
previously given (sections 4.25, 4.36) are used for the estimation of miss- 
ing values. With confounding, the formulae are changed, becoming in 
general more complicated. The different cases are discussed below. 

i. Complete mifoimding of the 2^, 2^, 2^, 2^, 8^ factorials. When 

the effects that are confounded are unimportant, we recommended that 
the first replication of the plan be used for all replications of the experi- 
ment, with complete confounding of these effects. In this case the esti- 
mation formula is simple. Let B be the total yield of all other units in 
the same block as the missing unit, and T be the total yield of all other 
units which have the same treatment as the missing imit. Every replica- 
tion contains a block which has the same set of treatments as the block 


6.34 


MISSING VALUES 


203 


with the missing value. Let B' be the total yield of all such blocks (in- 
cluding the block with the missing value). 

The value for the missing unit is estimated by the formula: 

_ rB -H hT ~ B' 

where k is the niunber of units per block, and r is the number of replica- 
tions. 

It will be realized that this is the standard formula for rand ( 3 mi zed 
blocks (section 4.25), applied only to those blocks which contain the 
treatment wdth the missing value. 

ii. Partial confounding of the 2^, 2^^ 2^, 4^^ 4 X 2^ factorials. 

The formula given later is valid when a difierent replication of the plan 
is used for each replication of the experiment. A further essential con- 
dition is that no treatment comparison be confounded in more than one 
replication. This condition is satisfied in all plans for the designs above, 
except possibly plan 6.4 (2^ in blocks of 4) and plan 6.6 (2^ in blocks of 
8). For instance, if the first 2 replications of plan 6.6 are used, no inter- 
action is confounded more than once; but if the first 3 replications ai'c 
used, BCD is confounded twice arid the formula does not apply. 

The formula requires some preliminary calculations, most of which 
can be used in the subsequent analysis of variance. In addition to B and 
T, as defined in case i, we require first the total R of all other units in the 
same replicate as the missing unit and the grand total G of all other units 
in the experiment. In the other replicates, calculate the total Bb of all 
blocks which contain the treatment with the missing unit. 

Next form the table of treatment totals. From this table compute 
the sum U of the observations for all other treatments which appear in 
the block which has the missing unit. In the other i^eplications, calculate 
the corresponding sum for each block which contains the treatment with 
the missing unit. Let the total of these (r — 1) sums be V, where r is 
the number of replicates. Note that in the calculation of U and V the 
treatment with the missing value is not included. 

The missing yield is estimated by the equation 


kt(r - 1)T + Hr - 1)B + kG + tV - krR - t{r - l)U - trSb 

_ ^ 1 ) ^ _ /,) ] 


The table on p. 204 which shows the number of units whose observa- 
tions are added to obtain the various totals T, J5, * • -, may be helpful in 
checking that the totals have been correctly identified. 

Consider f7, for example. There are (ifc — 1) other treatments in the 
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block with the missing value, and each treatment total is the sum of r 
observations. Consequently U is a total of T{k - 1) individual observa- 
tions. 


Total 

T 

B 

G 

Y 

E 

V 

Si 

Number of 
units 

(r - 1 ) 

i 

(k-l) 

(tr - 1 ) 

r{r ~ m - 1 ) 

a ~ 1 ) 

I 

fik — 1 ) 

k(r - 1 ) 


hi. Partial confounding of the 3 X 3^^ 3 X 3^ y 3^ X 4 X 5 X 3 fac- 
torials. For these designs the general method given by Yates (6.16) 
should be followed. 

iv. Single replication of the 3^, 3^, 3^, 3^ factorials. A missing value is 
estimated by minimizing the sum of squares for the interactions that are 
used as error. 

The estimation of missing values in cases hi and iv requires a knowl- 
edge of the algebra of the analysis of variance. The expressions for the 
missing value are usually fairly simple in case iv but are complicated in 
case hi. 

^Vhen the estimated value has been substituted, the analysis of vari- 
ance is calculated by the appropriate method for the design. As usual, 
the error d.f. are reduced by one for each estimated value. Variance- 
ratio tests of the treatment effects are slightly disturbed. Fori-tests, an 
approximate rule is to decrease the number of replicates ascribed to any 
mean by 134 each estimated value that the mean contains. Suppose, 
for instance, that (abc) is missing in a 2^ experiment with 3 replications. 
The mean of the units which receive a is taken over 12 observations, one 
of which is an estimated value. By the rule, the standard error of this 
mean is computed as s/a/IoJ instead of s/'\/l2. Consequently the 
standard error of the mean response to A is taken as 


instead of 


6.36 Estimation of the Gain in Precision from Confounding. As men- 
tioned previously, the advisability of confounding depends on the 
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amount by which the experimental error is decreased when the block size i 

is reduced. It is therefore worth while, especially where confounding has 
not been previously used, to estimate the gain in precision which has been 
obtained in each experiment. 

The technique is essentially the same for all types of design. From the 
results of the analysis of variance of a confounded experiment, an esti- 
mate Er is made of the experimental error which woiJd have been 
present if the experiment had been laid out in randomized complete 
blocks. 

Single replication. Let Ejy be the mean square for blocks, and Ee the 
mean square for error. If there is only 1 replication in the experiment, i 

the estimate iJ'r is : ^ , 

( 6 . 1 ) 

^6 + Ue I 

where Ub == number of degrees of freedom for blocks. I 

lie = total number of degrees of freedom minm | 

For example, with a 2^ experiment in blocks of 8 units (single replication) i 

Ub = 7, ■■ Ue =, 63 — 7 — . 56 ■, | 

The estimate Er is directly comparable with the estimated error Ee* As I 

shown in section 2.31, the comparison may be expressed in terms of the t 

relative amounts of replication required to obtain equal accuracy with 
the tw^o types of layout, [ 

The estimate Er is subject to the assumption that the interactions ^ 

which are confounded with blocks are negligible. Since confounding is 
complete, no method of estimation avoids this assumption. If the con- ;; 

founded interactions are not negligible, the calculation overestimates the I 

gain from confounding. ii 

Moi^e than one replication. If the confounding is complete, formula '■ 

6.1 holds for E^ subject again to the assumption that the confounded 
comparisons are negligible. In this case ] 

Eh — mean square for blocks within replicaUons. [ 

Ee = error mean square. I 

rib — number of degrees of freedom for blocks within replications. : 

7ie = total number of degrees of freedom minus number of degrees of 

freedom for replications minus Ub* \ 

Thus with 2 replicates of a 2^ design in blocks of 8 units, the same inter- ■ 

actions being confounded in both replicates, we have 
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With partial confounding, the same formula may be used if all con- 
founded comparisons are assumed to be small. Evidence on the validity 
of this assiunption is provided by the experimental results, since partially 
confounded comparisons can be tested for significance. 

It is possible to avoid the assumption that confounded comparisons are 
negligible, though some e.xtra computations are necessary. We first 
adjust the mean square among blocks within replicates so as to remove 
treatment effects. The following identity holds for the sums of sqxiares. 

Blocks within replicates (adj.) + treatments (rmadj.) = 

blocks within replicates (unadj.) + treatments (adj.) 

The two quantities on the right of the equation appear in the analysis 
of variance of the experimental results. Consequently the adjusted 
blocks S.S. is found from this equation by calculating the unadjusted 
treatments s.s. From the adjusted blocks s.s. we obtain, as shown 
below, a new estimate Ef for use in formula (6.1). 

These procedures may be illustrated from the results of the 3X3X2 
experiment in 4 replicates (section 6.19), in which .,4B and ABC sxe 
partially confounded. From table 6.15 the following summar''^ of the 
analysis of variance is obtained. 



di. 

s.s. 

m.s. 

Replications 

3 

3,837 

1,279 

Blocks within replications 

S 

2,836 . 

354 

Treatments 

17 

7,802 

450 

Error 

43 

8,909 

207 

Total 

71 

23,384 



If AB and ABC are assumed negligible, the estimate Er is found by 
formula (6.1) for the case where there is more than one replication. 

8 X 354 -f 60 X 207 

Er = 224 

68 

In order to avoid the assumption, we calculate the unadjusted treat- 
ments S.S., which is found to be 8328. Since the adjusted treatments 
s.s. is 7802, we have 

Blocks s.s. (adj.) = 2836 -H 7802 - 8328 = 2310 

corre.sponding to a mean square E^^ of 289. 

An unbiased estimate Ef of the variance between blocks within rep- 
licates is 


AEi, - Ee 4 X 289 - 207 
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This value is inserted in formula (6.1) to give E/: 

8 X 316 + 60 X 207 

= ^ == 220 ' 

68 

The estimated gain in efficiency over randomized blocks is 8% when 

AB and ABC are assumed negligible and 6% without this assumption. 

The two results agree closely, as is usually the case unless the confounded 

interactions are large. 

The method above applies to all designs with partial confounding. 

For the designs discussed in this chapter, the formula for Ei/ in terms of 
and Ee is 

tEiij — Eq 

r — 1 ■' 

where r is the number of replicates. 
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• PLAKB' . ' 

Plan 6.4 Balanced group of sets for 2^ factorial, blocks of 4 units 

Two-factor interactions are confounded in 1 replication and three-factor 
interactions are confounded in 3 replications. The columns are the blocks. 

Rep. I, AB, confounded Rep, 11, AC, ABD, BCD 
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Plan 6.6 Balanced group of sets for 2^ factorial, blocks of 8 units 

Three- and four-factor interactions are confounded in 1 replication. 
Rep. I, ABC, ADE, BODE confounded Rep. II, ABD, BCE, ACDE 
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Plan 6 M (Continued) 

Balanced group of sets for 2® factorial, blocks of 8 units 


Rep. Ill, ACE, BCD, ABDE 
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Plan 6S Balanced group of sets for 2® factorial, blocks of 8 units 

All three- and four-factor interactions are confounded in 2 replications. 
Eep. I, ABC, CDE, ADF, BEF, ABDE, BCDF, ACEF confounded 
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liep. II, ABD, DBF, BCF, ACE, ABEF, ACDF, BCDE 
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Plan 6.6 (Contpmied) 


Balanced group of sets for 2® factorial, blocks of 8 units 
R,ep. Ill, ABE, BDF, ACD, CEF, ADEP, BCDE, ABCF 
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Eep. VI, ABC, BDE, ADF, CEF, A.CDE, BCDF, ABEF 
Interchange B and C in replication I 
Rep. VII, ABF, DEF, BCD, ACE, ABDE, ACDF, BCEF 
Interchange F and D in replication II 
Rep, VIII, ABE, BDF, CDE, ACF, ADEF, ABCD, BCEF 
Interchange A and E in replication III 
Rep, IX, ABD, CDF, AEF, BCE, ABCF, BDEF, ACDE 
Interchange F and D in replication IV 
Rep. X, AEF, BDE, ACD, BCF, ABDF, CDEF, ABCE 
Interchange E and C in replication V 
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Plan 6.7 Balanced group of sets for 3^ factorial, blocks of 9 units 

JiBC confounded 
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Plan 6.8 Balanced group of sots for factorial, blocks of 9 units 
Tliree-factor interactions confounded 
Rep. I, ABC IV, ABD I, ACD II, BCD III confounded 
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Plan 6.8 (Continued) 

Balanced group of sets for 3^ factorial, blocks of 9 units 


Rep. II, ABC 111, AGD IV, ABD 11, BCD I 
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Plan 6 JO Balanced group of sets for 3 X 2^ factorial, blocks of 6 units 
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Plan Balanced group of sets for 4- factorial, blocks of 4 units 
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Plan 6JS Balai.«ed group of sots for 4 X 2^ factorial, blocks of 8 uuita 
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Balanced group of sets for 4 X 3 X 2 factorial, blocks of 12 units 
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Plmi 6J4 {Continued) 

Balanced group of sets for 4X3X2 factorial, blocks of 12 units 
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CHAPTER 7 


FACTORIAL EXPERIMENTS WITH MAIN EFFECTS 
CONFOUNDED: SPLIT-PLOT DESIGNS 


7.1 The Simple Split-plot Design 

7.11 Description. In field experiments an extra factor is sometimes 
introduced into an experiment by dividing each ph)t into a number of 
parts. For example, if the experiment is planned originally to test a 
factor A with five levels, the division of each plot into halves permits the 
inclusion of an extra factor B at two levels. Within each plot the two 
levels of B are allotted at random to the two sub-plots. If the whole 
plots are in a randomized block design, the plan (after randomization) 
might appear as shown in table 7.1. It is worth noting the difference be- 

TABLE 7.1 Example op a rplit-plot design 
Eep. 1 Rep. 2 Rei>, 3 

az ai % f<'o ai 04 (12 d-i css crn <^2 (I4 


h 


h 




h 


60 


60 


h 


tween this arrangement and ordinary randomized blocks. In the latter 
the ten treatment combinations are assigned to the ten sub-plots in a 
replication completely at random. Here we have a more orderly assign- 
ment in which tlie two treatment combinations that have any given level 
of A always appear in the same whole-plot. 

This type of arrangement is common in industrial experimentation, 
although the connection may not at first be realized. Frequently, one 
series of treatments requires rather a large bulk of experimental ma- 
terial, while another series can be compared with much smaller amounts. 
For instance, the comparison of different types of furnace for the prep- 
aration of an alloy would use much greater amounts of alloy than the 
comparison of different types of mould into which the alloy might be 
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poured. In an experiment in which both factors are to be tested, the 
natural procedure is to take the material prepared in any furnace, and 
pour some of it into each mould. That is, material prepared in one 
furnace at one time provides a complete replication for the comparisons 
among moulds, just as the plot containing any level of A provides a com- 
plete replication of the factor B. Another instance is the comparison of 
different machines for milking dairy cows. Each machine would neces- 
sitate rather substantial amounts of milk, A\d:iereas other compariBonSj 
for example on the best method of pasteurizing or of cooling, could be 
conducted with a much smaller amount of milk per treatment. The 
produce from any machine could be subdivided for these subsequent 
tests. 

We may describe this design in another way that brings out more 
clearly its relation to the confounded factorial designs discussed in chap- 
ter 6. If the sub-units are I'egarded as the experimental units, it is seen 
that the treatments ao, at , . . . , are applied to groups or blocks of two 
units. Differences among these blocks are confounded with dilferences 
among the levels of A; i.e., the main effects of A are confoimded. Ac- 
cordingly, the split-plot design is sometimes considered as one in which 
certain rnam effects are confounded, as contrasted with the designs in 
chapter 6, where the confounding is restricted to interactions. 

7.12 Nature of tlie Experimental Error, In the statistical anah^^sis, 
account must be taken of the fact that the observations from different 
sub-units in the same imit may be correlated. In field experiments this 
coiTelation is just a reflection of the fact that neighboring pieces of land 
tend to be similar in fertility and in other agronomic properties. In in- 
dustrial experiments the same correlation may be present, because any 
factor that affects the whole batch of alloy prepared in one furnace at 
one time will tend to create similarity among smaller batches poured 
from it. 

The mathematical analysis used to examine the effects of this correla- 
tion will be illustrated for the experiment in table 7.1. Let i refer to the 
replication, j to the level of A, and k to tliat of B: for siib-imits in the 
same unit, the i and j subscripts will be the same. The assumption is 
made that a correlation p exists between the experimental error>s etjjc and 
eiju for any two sub-units in the same unit. Sub-units in different imits 
are assumed to be uncorrelated. Mathematically, we have 

We now consider the effects of this model on the most important treat-r, 
ment comparisons. The main effects of A are calculated entirely from 
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unit totals or means. With two sub-units per unit, the error variance of 
a unit total is 

E(eiji + eij2)^ == + 2E(eijieij2) = (t- + or^ + 2pcr^ 

^2a^(l+p) 

The factor 2 may be regarded as representing the effect of adding over 
2 sub-units. Consequently, for the main effects of the appropriate 
error variance per sub-unit is cr^(l + p). If there are ^ sub-units per 
unit, the corresponding variance works out at a^[l + (/3 — l)p]. 

The main effects of B, on the other hand, are derived from the dif- 
ference between the two sub-units in a unit. For this we have 

Eieiji - eij2f ^ 2(r^(l - p) 

Thus the effective variance per sub-unit applicable to the main effec^ts 
of B is 0-^(1 — p). This expression remains unchanged when there are 
^ sub-units per unit. This variance also applies to any component of the 
AB interaction, since such components involve comparisons of {hi — bo) 
at different levels of A. 

For other treatment comparisons, the basic error variance may be 
different from either of the two expressions above. Consider, for in- 
stance, (05360 ^i6o)j a comparison of with ai at the zero level of i?. 
In any replication the 2 sub-units involved are in different units, and are 
therefore independent. The variance of their difference is therefore 
2cr^, and the basic variance per sub-unit is However, the appropriate 
variance for all other comparisons of this type can be derived from the 
basic variances cr^[l + — l)p] and 0*^(1 — p). As will be seen, the 

analysis of variance gives unbiased estimates of these two variances, 
and from these, unbiased estimates of any particular variance can be 
obtained. 

In practice, p is nearly always positive. The result is that the main 
effects of A (the factor applied to the units) are less precisely estimated 
than those of B or than the AB interaction. 

The analysis of variance is fairly easy. For the example in, table 7.1 w^e 
first compute the 15 plot totals. Their sum of squares of deviations is 
partitioned in the usual way into 2 d.f. for replications, 4 for the main 
effects of Aj and 8 for the experimental error applicable to a whole-plot. 
The mean square for the latter is an unbiased estimate of 0-^(1 -j- p). 
All computations are divided by 2 to convert them to a sub-unit basis. 

Next take the difference (61 — bo) on every unit. The 15 differences 
provide 1 di. which represents the main effect of J?, 4 di. which represent 
the AB interactions, and the remaining 10 d.f. whose mean square gives 
an unbiased estimate of the sub-plot error variance cr^(l — p). All sums 
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of squares are again divided by 2, The complete separation of degrees 
of freedom is shown in table 7.2. 


TABLE 7.2 Analysis of variance foe the split-plot expioriment in 


TABLE 7.1 

d.f. 

Whole plots 


Replications 

2 


4 

Whole-plot erroi’ 

■ 8 ■ 

Total 

14 ■■ ■ 

Sub-plots 


B 

1 . 

' AB 

4 

Sul)-plot; error 

10 

Grand total 

29 

The point to be noted is that the whole-plot error is computed en- 

tirely from plot totals, and the sub-plot error 

entirely from the differences 

between sub-plots in the same plot. It will be seen that the grand total 
of the degrees of freedom is 29. As might be expected, the corresponding 


sum of squares is the siun of squai^es of deviations of the 30 observations 
from their mean. A computation of this quantity provides a check on all 
sums of squares. In practice, particularly with more than 2 sub-unit 
treatments, we usually find the sub-plot error by subtraction. 

7.13 Comparison with Randomized Blocks. The experiment in table 
7.1 might be arranged in ordinary randomized blocks, with 3 blocks of 
10 treatments each. In a consideration of the relative merits of tlie two 
arrangomentB, the following points are i*elevant. 

1. With the split-plot design, usually the B and the AB effects are 
estimated more precisely than the A effects. Moreover, the number of 
degrees of freedom available for the experimental error m.s. is smaller 
for whole-unit comparisons than for sub-unit comparisons. 

2. It can be shown that the average experimental error over all treat- 
ment comparisons is the same for both designs. Consequently there is no 
net gain in precision resulting from the use of the split-plot design ; the 
increased precision on B and AB is obtained by the sacrifice of precision 
on A. For tests of significance or the construction of confidence limits 
the randomized block design holds a slight advantage on the average 
since it provides more degrees of freedom for the estimate of the single 
error variance that it requires. For instance, the experiment cited has 
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8 di. for the whole-unit error and 10 di. for the sub-unit error, whereas 

randomized blocks would provide 18 d.f. 

3. As we have indicated, the chief practical advantage of the split- 
plot arrangement is that it enables factors that require relatively large 
amounts of material and factors that requii-e only small amounts to be 
combined in the same experiment. If the experiment is planned to in- 
vestigate the first type of factor, so that large amounts of ma.terial are 
going to be used anyway, factors of the second type can often be in- 
cluded at very little extra cost, and some additional inf ormation obtained 
very cheaply. 

To summarize, the split-plot design is advantageous if the B and AB 
effects are of greater interest than the A effects, or if the A effects cannot 
be tested on small amounts of material. 

Two disadvantages have been mentioned by experimenters. Some- 
times the whole-unit error is much larger than the sub-unit error. It 
may happen that the effects of A, though large and exciting, are not 
significant, whereas those of B, which are too vsmall to be of practical 
interest, are statistically significant. The experimenter tends to be 
uncomfortable in reporting results of this type. Secondly, the fact that 
different treatment comparisons have different basic error variances 
makes the analysis more complex than Avith randomized blocks, espe- 
cially if some muisual type of comparison is being made. 

When the number of replications and the experimental conditions are 
suitable, the whole units may be arranged in a latin square. A split-plot 
latin square, which eliminates the error variation arising from two types 
of grouping, may be preferable to randomized blocks. Summarizing 22 
field experiments in latin squares where the plots were split into halves, 
Yates (7.1) found a substantial net increase in precision over ran- 
domized blocks. The superiority of the latin square was so pronounced 
that even the whole-plot comparisons would have been less precisely de- 
termined in randomized block designs. Factorial combinations that lend 
themselves to the use of split-plot latin squares are the 5 X 2, 5 X 3, 
5 X 4, f3 X 2, 6 X 3, 7 X 2, 7 X 3, 8 X 2. 

7.14 Randomization. The treatments applied to the units are ran- 
domized according to the instructions for the design (e.g. randomized 
blocks, latin square, etc.) in which the units are arranged. The treat- 
ments applied to the sub-icnits are allotted at random vathin each unit. 
A separate randomization is carried out for each unit. 

7.15 Statistical Analysis. If the factor A (applied to the units) con- 
tains a levels, and the factor B (applied to the sub-units) contains 
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STANDARD ERRORS 


m 


levelSj the subdivision of degrees of freedom in the analysis of variance is 
shown in table 7.3* The details of the analysis are illustrated in section 
7.17. 


TABLE 7.3 Paetition of degrees of feee»om for a design 

Units arranged in randomized blocks Units arranged in a latiri square 


(r replicates) 


(r - a replicates) 


Units 

d.f. 

Units 

di. 

Blocks 

(r-l) 

Rows 

(a - 1) 

A 

ia - 1) 

Columns 

(a — 1) 

Error (a) 

(« - l)(r - 1) 

A ' 

(a - 1) 



PhTor (a) 

(a - l)(a - 2) 

Total 

(ra — 1) 





Total 

(a° — 1) 

Bub-units 


Sub-units 


B 

(H - 1) 

B 

- 1) 

AB 

(a - m - 1) 

AB 

{a - Dip - 1) 

Error (6) 

a{r - l)(/3 - 1) 

Error (h) 

a{a — 1)(0 - 1) 

Total 

m(j(3 — 1) 

Total 

a^ip-1) 


7.16 Standard Errors, Let Ea and be the error mean squares for 
eiTor (a) and error (6), respectively, on a sub-unit basis. For the treat- 
ment means, also expressed on a sub-unit basis, the standard errors 
shown in table 7.4 apply. The final comparison in table 7.4 contains 

TABLE 7.4 Standard errors for the split-plot design 
Treatment comparison s.e. 

DiilGrcnce between two A means: e.g., [(ai) — (ao)] '\/2Ea/r§ 

DiiYerence betwe(3n two B means: e.g., [(/q) — (bo)] V^2Eh/ra 

Diffcrcmee between two B means at the same level of A : 

o.g., [{(hbi) — (aiho)] V2Eb/r 

Difference between two A means at the same level of B: 
e.g., [(aii>i) - (aohi)] Vm^lEV+E^]/^ 

both the main elfect of A and the AB interaction; eonseqx-iently the ap- 
propriate error is a weighted mean of Ea and Ei,. This error also applies 
to the difference between two A means which have different levels of B. 
in such cases the ratio of the treatment difference to its standard error 
does not follow the Rlistribxition. For practical purposes the uixprox- 
imate rule of section 4.14 may be used, though this method gives slightly 
too few significant results* Let 4,5 h be the significance levels of t corre- 
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spending to the degrees of freedom in Ea and £/?,; respectively. The 
significance level of t is taken as 

(/3 — l)Mbth + Eata 
{P — + Eu 

For an application, see the next section. 

7.17 Numerical Example. In an experiment on the preparation of 
chocolate cakes, conducted at Iowa State College (7.2), 3 recipes for 
preparing the batter were compai’ed. Recipes I and II differed in that 
the chocolate was added at 40''C. and 60''C., respectively, while recipe 
III contained extra sugar. In addition, 6 different baking temperatures 
were tested: these ranged in 10'' steps from 175° to 225°. Each time that 
a mix was made by any recipe, enough batter was prepared for 6 cakes, 
each of which was baked at a different temperature. Thus the recipes 
are the “whole-uniF' treatments, while the baking temperatures are the 
^'sub-unit'' treatments. There were 15 replications, and it will be as- 
sumed that these were conducted serially according to a randomized 
blocks scheme: that is, one replication was completed before starting 
the next, so that differences among replicates represent time differences. 
(The notes suggest that this was done, though they are nob quite ex- 
plicit.) 

A number of measurements were made on the cakes. The measure- 
ment presented here is the breaking angle. One half of a slab of caJee is 
held fixed, while the other half is pivoted about the middle until breakage 
occurs. The angle through which the moving half has revolved is read 
on a circular scale. Since breakage is gradual, the reading tends to have 
a subjective element. The data are shovm in table 7.6. 

It is customary to compute the analysis of variance on a sub-unit 
basis. To avoid confusion, this should be clearly stated in the analysis 
of variance table itself. The calculations may be presented in three 
steps. 

Sic'p 1. Analyze the whole-unit totals by the method appropriate to 
the design in which they are arranged. 

(8673)2 

- 278,596 


Correction factor; • 


Total: 


270 

(269)2 + (260)2 +. 


^ -f (165)2 


278,596 


11,538 


Replications: 


(843)2 + (820)2 +...+ (479)2 


18 


278,696 - 10,204 


Recipes: 


(2981)2 + (2848)2 + (2844)2 
90 


278,596 


Error (a), by subtraction: 11,538 - 10,204 - 135 


135 

1,199 


7,17 


NUMERICAL EXAMPLE 


225 


Recipe I 


Totals 


Pt.ecipe II 


Totals 


Recipe III 


TABLE 7.5 Breaking angles (degrees) 


Rep. 

175" 

’185" 

Temperature 

195" 205" 

215" 

225" 

Unit 

totals 

1 

42 

46 

47 

39 

53 

42 

269 

2 

47 

29 

35 

47 

57 

45 

260 

3 

32 

32 

37 

43 

45 

■ ’45 ■ 

234 

4 

20 

32 

35 

24 - 

39 

26 

1S2 

5 

28 

30 

31 

37 

41 

47 

214 

6 

24 

22 ■ 

22 

29 

35 

26) 

158 

7 

26 

23 

25 

27 

33 

35 

160 

8 

24 

33 

23 

32 

31 

34 

177 

9 

24 

27 , 

28 

33 

34 

23 

160 

10 

24 

33 

27 

31 

30 

33 

17S 

11 

83 

39 

33 

28 

33 

30 

196 

12 

28 

31 

27 

39 

35 

43 

203 

13 

29 

28 

31 

29 

37 

33 

187 

14 

24 

40 

29 

40 

40 

31 

204 

15 

20 

28 

32 

25 

37 

33 

181 


437 

473 

462 

503 

*580 

526 

2981 

1 

39 

46 

51 

49 

55 

42 

282 

2 

35 

46 

47 

39 

52 

61 

280 

a 

34 

30 

42 

35 

42 

35 

218 

4 

25 

26 

28 

46 

37 

37 

199 

5 

31 

30 

29 

35 

40 

30 

201 

6 

24 

29 

29 

29 

24 

35 

170 

7 

22 

25 

26 

26 

29 

36 

164 

8 

26 

23 

24 

31 

27 

37 

168 

9 

■ 27 

26 

32 

28 

32 

33 

178 

10 

21 

:/'.24:; : 

24 

27 

37 

30 

163 

11 

20 

27 

33 

31 

28 

33 

172 

12 

23 

28 

31 

34 

31 

29 

176 

13 

32 

35 

30 

27 

35 

30 

189 

14 

23 

25 

22 

19 

21 

35 

145 

. 15 

21 

, 21 

28 

26 

27 

20 

143 


403 

441 

476 


517 

529 

2S48 


46 

' 44'- 

: 45 


48 


292 

2 

43 

:':'43'V 

43 

46 

47 

58 

280 

3 

33 

24 

40 

37 

41 

38 

213 

4 

38 

41 

38 

30 

36 

35 

218 

5 

21 

25 

31 

35 

33 

23 

168 

0 

24 

33 

30 

30 

37 

35 

189 

7 

20 

21 

31 

24 

30 

33 

159 

8 

24 

23 

21 

24 

21 

35 

148 

9 

. ■■■ 24 :''- 

18 

21 

26 

28 


145 

10 

26 

28 

27 

27 

35 

35 

178 

11 

28 

25 

20 

25 

38 

28 

170 


24 

30 

28 

35 

33 

28 

178 

.:t3, ; 

28 

29 

43 

28 

33 

37 

198 

14 

19 


27 

25 

25 

35 

153 

15 

21 

28 

y: 25;::^'' 

25 

31 


155 


419 

434 

476 

463 

516 

536 

2844 


1259 

11348;;:;;:: 


1448 

ilp®; 




8073 


Totals 

Temp, totals 
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Step 2. This concerns the sub-unit treatments. Their main effects are 
obtained directly. 

(1259)2 -I- (1348)2 + . . . + (1591)2 


Temperatures : ■ 


45 


278,596 = 2100 


The sum of squares for interactions between sub-unit and whole-unit 
treatments is found by subtraction. First calculate the total s.s. for the 
two-way table that shows both sets of treatments. 


Total treatments: 

(437)2 -f (473)2 (510)2 ^ ( 530)2 

15 


278,596 = 2441 


Then, 

Recipes X temperatures: 2441 — 135 — 2100 = 206 
Step 8. Compute the total s.s. among all sub-units. 

Total: (42)2 + ( 47)2 ( 35)2 ^ ( 25)2 _ 278 ^ 59(5 = 13^143 

The sum of sauai’es for error ( 6 ) is then found by subtraction in table 
7.6. 

TABLE 7.6 Analysis of vaei-anoe of bbeaking angles (on a sub-tjnit 
basis) for the expbehAent on chocolate cakes 


Source of variation 

d.f. 

s.s. 

m.s. 

F 

Replications 

14 

10,204 



Recipes 

2 

135 

67.5 

1.58 

Error (a) 

28 

1,199 

42.8 


Temperatures 

6 

2 , 100 ^ 

420.0 

20.49 

Linear regression 

1 

^1,967 

1,967.0 

95.95 

Deviations 

4 

133 

33.2 

1.62 

Recipes X temperatures 

10 

206v 

20.6 

1.00 

Error (i>) 

210 

4,299 

20.5 


Total 

269 

\ 

1 cc 

{ 

00 

1 




The mean square for recipes, while above that for error (a), does not 
approach the 5% level. Temperature effects are highly significant as 
compared with error (5). Since there is a fairly steady increase in the 
breaking angle with increased temperature, the siun of squares has been 
divided into the component due to a linear regression on temperature 
and that due to deviations from the straight line. The regi-ession coeffi- 
cient amounts to an increase of 1 . 6 ° in the breaking angle for each 10 ° 
rise in baking temperature. The mean square for deviations is not sig- 
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nificaiit, thoiigli it is higher than expectation; There i>s no indication of 
any interaction. It will be observed that the error (a) m.s. is about twice 
as large as that for eiTor (5). 

In table 7.7 are shown the treatment means, with the principal stand- 
ard errors as obtained from table 7.4. 

TABLE 7.7 Bee AK iNG ANGLE MEANS (dEGEEES) 


Bt!ciX)e 


Recipe 

175° 

185° 

195° 

205° 

215° 

225° 

means 

I 

29.1 

31.5 

30.8 

33.5 

38.7 

35.1 

33.1 

II 

26.9 

29.4 

31.7 

32.1 

34.6 

35.3 

31.6 

Hi 

27.9 

28.9 

31.7 

30.9 

34.4 

35.7 

31,6 

Temp, means 

28.0 

30.0 

31.4 

32.2 

35.9 

35.4 

32.1 


Standard error of difference between 
Two recipe means: 

Two temperature means: 

Two temperature means for one recipe: 

Two recipe means for a given temperature: 


12 ( 42 , 8 ) 
Y 90 




2(20.5) 

45 


12 ( 20 . 5 ) 

y-jT- 


= O.iJS (28 d.f.) 
= 0.95 (210 d.f.) 
= 1.66 (210 d.f.) 




2[5(20.5) + 42.S] 
;»0 : 


l.SO 


Tlie 5% levels of t are 2.05 and 1.97, respectively, for 28 and 210 d.f. 
Consequently, the 5% level for the last standard error above is 

(5) (20.5) (1.97) + (42.8) (2.05) ^ ^ 

(5) (20.5) + (42.8) 

This value always lies between the two individual f-valucs. In practice 
it need rarely be calculated. 


7.18 Missing Data. The formulae for hiserting estimates of missing 
values have been developed by Anderson (7.3). Suppose first that the 
observation for a single sul>unit is missing, and that this sub-imit re- 
ceives the treatment combination (fflihj). Let 

U = total for the unit containing the missing observations. 

{AiBj) = total of all sub-rmits that receive the treatment combination 

(oA)- 

(.di) = total of all observations that receive the fth level of A. 
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Then the estimate to be inserted for the missing value is 

- Ui) 

y - (r - l)(/3 - 1) 

Example. Suppose that the observation for recipe II, 195° temper- 
ature, in replication 2 is missing. (Its actual value is 47.) Then 

17 = 280 - 47 = 233; (4iBy) = 476 - 47 = 429; 

(4f) = 2848 - 47 = 2801 

15(233) 4- 6(429) - 2801 3268 

^ (14)(5) 70 

By accident, the estimated value is the same as the actual value. If 
several observations are missing, repeated use is made of the formula as 
described in section 4.25. 

In the analysis of variance 1 di. is subtracted from the error (b) di. 
for each missing observation. An unbiased estimeAe of Ei, is obtained, 
but the treatment sums of squares and that for error (a) are biased up- 
wards. If only a small fraction of the values is missing, it appears that 
these biases can be ignored; methods for obtaining unbiased estimates are 
given by Anderson (7.3), 

Standard erroi's, l^Tien there are missing observations, the formulae 
in table 7.8 are suggested for the differences between 2 means. 

TABLE 7.8 Standard errors for split-plot experiments with 

■■■■MISSINa DATA:' 

Treatnient comparison s.e. 

Difference between two A means: V2[^a~+/^6]/ri!3 

Difference between two B means: \^2Eh[l -h (fi/ci)]/roi 

Difference between two B means at the same 
level of A : " +iWM]/r 

Difference between two A means at the same 
level of B: - 1} +W]/r0 

If only one value is missing in the experiment, and if a mean containing 
that value is compared with another mean, the factor /is -Kr ~ 1)0 — 1), 
and the formulae are exact, reference (7.3). 

With more than 1 missing observation, the value of / depends on the 
locations of the missing observations. The following approximation. 
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(levelopecl by G. S. Watson, is correct for a number of cases but other- 
\vise tends to be slightly too high. 

/.— ? 

2^r--d){p-k + G-l) 

When counting the values of k, c, and d, as defined below, be sure to 
ignore all missing observations except those that occur in the 2 means 
that are being compared. 

k = number of missing observations. 

c ~ number of replications which contain 1 or more missing observa- 
tions. 

d — number of missing observations in the sub-unit treatment {(lihj) 
that is most affected. 

For example, suppose that in the cake experiment the values for recipe 
T, 175° temperature, are missing in replications 1, 2, and 3, while those 
for recipe I, 185° temperature, are missing in replications 3 and 4. We 
are comparing the means for the 2 temperatures, taken over all recipes. 
Then k is 5, and c is 4. The sub-unit treatment with the most missing 
values is recipe I, 175° temperature, which has 3 missing, so that d is 3. 

The case where a complete unit is missing is discussed by Anderson 
(7.3). 

7,2 Repeated Subdivision 

7.21 Description, In order to include a new factor C at y levels, 
the sub-units may each be divided into sub-sub-units. There are three 
experimental error variances. Errors (a) and (h) have the same functions 
as described previously and are calculated in the same manner, except 
that an extra dmsor y is introduced into all sums of squares in order to 
present the analysis on a sub-sub-unit basis. Error (c), which will usually 
be the smallest of the three, applies to the C, AC, .BC, and ABC effects. 
The process of subdivision may be carried as far as is convenient. 

The partition of degrees of freedom has already been given (table 7.3) 
for factors A and B. For C and its interactions, the partition is shown in 
table 7.9. 

TABIjE 7,0 Analysis of variance foe C and its interactions 


Effect 

d.f. 

C 

(t — 1) 

AC 

(« - 1)(t - 1) 

BC 

- 1)(7 - 1) 

ABC 

{« - l)(^i - my - 1) 

Error (c) 

aKr - 1)(7 - 1) 


i 

I 
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The computations in the analysis of variance are a straightforward 
extension of those for the simple split-plot design. The sum of squares 
for ylC, for instance, is obtained by calculating the total s.s. for the A 
by C two-way table, and subtracting the sums of squares due to .4 and 
C] and similarlj'’ for the other factorial effects in table 7.9. Finally, the 
sum of squares for error (c) is found by calculating the total s.s. among 
sub-sub-units, and subtracting the contributions from all other items in 
the analysis. Worked e.xamples are given in references (7.4) and (7.5). 

7.22 Standard Errors. The formulae in table 7.4 for standard errors 
applicable to .4 and J5 effects remain valid apart from division by an ad- 
ditional factor Vt- For the principal comparisons which invoke C ef- 
fects, standard errors are shown in table 7.10. All treatment means and 
error mean squares are assumed to be on a sub-sub-unit basis. 

TABLE 7.10 St,«.ndakd ehbobs bob the split-plot design with two 

SUBDIVISIONS 


Treatment comparison 

[(c9 - W! 

[(lEl) - (oico)l 

[(fllfil) - (ooci)] 

[ihci) - (&lCo)l 

K'nci) - (&oci)l 


4 


7'a^ 


hEc 
\ r/3 

p(7 - 1)^. + ^] 

IWa 

\ m 

^ 2[(t + 

, /V; , ■■ f<xy 


7.3 Some Variants of the split-plot Design 

7.31 Systematic Arrangement of the Treatments AppKed to the Units. 
The device of subdividing each unit into a number of sub-uuits is very 
flexible, and can be used with any type of design in whicli the units are 
arranged. Some of the more useful variations that are possitfle are de- 
scribed in the remainder of this chapter. The first arises when the nature 
of the experiment makes it necessary or desirable to have the A. treat- 
ments arranged in a systematic design. This arrangement has been 
used in experiments where the A treatments were varieties of wheat 
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known to mature at different dates/ and the B temtments were prepara- 
tions applied to the seed before sowing. Since each variety was to be 
harvested when ripe, the field operations were made easier by planting 
the varieties in the order in which they would be ready for harvest. The 
field plan is iiiustrated in table 7.11, where Ui is the earliest variety and 
as the latest. 

TABLE 7.11 Example OF BYSTEM ATIC ABHANGEMENT OF TBEATMENTS 


APPLIED TO UNITS 

at ad ar, 



% (12 ^4 Gt6 


III 

The sub-unit treatments are randomized within each unit; the unit | 

? - treatments, however, appear in separate strips. 

This arrangement provides no valid estimate of error for the A main 
effects, or for comparisons such as (aibo — agbo) that involve A effects. 

Thus the ^hvhole-plot^’ analysis of variance is irrelevant. In the sub-plot 
] analysis of variance, error (b) is still valid for testing the B effects and 

the AB interactions. One point should be watched. A comparison such 
as (ai5i — aibo) is derived entirely from the first strip in the plan. Con- 
f sequently, if the sub-plot error variance differs from strip to strip, the 

use of the error (b) m.s. for testing this quantity is not justified; instead, 
it is necessary to compute a separate error using only data from the first 
strip. 

: To summarize, systematic placement of the A treatments is advisable 

! only where it is essential in order to conduct the experiment and where 

i no test of the A main effects is required. 

y 7.32 Sub-unit Treatments in Strips. In a further variant the sub- 

unit treatments, instead of being randomized independently within each 
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unit, are arranged in strips across each replication. For a 5 X 3 design 
the appropriate rearrangement (after randomization) might be as shown 
in table 7.12. 

This layout may be convenient for field experiments where it is neces- 
sary to test both factors on relatively large areas and to leave free access 


TABLE 7.12 Design WITH sub-unit treatments in strips 
■ Rep. I ' ■ ■ . , 

as Cl C2 Co 


h 

bo 

h 


ill 


Rep. II 

04 ao C2 


cs 


hi 

h 

h 


ai 


Hop. Ill 

as Co 02 


a4 


h 

ho 


Analysis of variance 
d.f. 

Replications 


A 

Error (a) 
B 

Error (h) 
AB 

Error (c) 
Total 


2 

4 

5 
2 
4 
8 

R) 

44 


at both ends, Similar conditions may apply in other types of experi- 
mentation. 

As with the ordinary split-plot design, the average precision over all 
treatment comparisons is the same as that of randomized blocks arranged 
on the sub-units. The present design sacrifices precision on the main ef- 
fects of A and B in order to provide higher precision on the interactions, 
which will generally be more accurately determined than in either ran- 
domized blocks or the simple split-plot design. Since in addition the 
numbers of degrees of freedom for estimating the errors of the A and B 
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effects are likely to be small, the design is not recommended unless prac- 
tical considerations necessitate its use or unless the interactions are the 
principal object of study. Both the A and B treatments should be ran- 
domized independently within each replication. 

TABLE 7.13 Latin square design with sub-unit treatments in strips 

(l2 ' CL4 . «'3 .. ■ ■ ■ 

bi 


bo 

b 2 


Ui as ai 02 oq 


bi 

bo 

6‘2 

b 2 

hi 

bo 

hi 

bt 

bo 


ao az ai a\ 


ai Oq 04 03 02 


AnaiysiB of variance 


Rows 

a.f. 

4 

Columns 

4 

A 

4 

.Error (a) 

12 

B 

2 

Error {h) 

8 

AB 

8 

Error (<?) 

32 

Tot-al 

',",^74 


bo 

h 

bi 


«8 


04 02 ao fll 
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In the statistical aiial.ysiB, separate estimates of error are obtained for 
A, J5, and AB, Both sets of main effects may be regarded as tested in 
randomized block designs, in which the sum of squares for replications 
(blocks) is the same for both designs. A worked example (on uniformity 
data) is given in reference (7.6), 

Formulae for the estimated standard errors of the differences between 
treatment means per sub-unit are given in table 7.14. The mean squares 
Ea, Eh Ec are assumed to be on a sub-unit basis. 

TABLE 7.14 Stand ahi> erroks when sub-unit treatments are in strips 
{a = nimiber of levels of A; (3 - number of levels of B; r ~ number of replicates) 
Treatment comparison s.e. 


m -- (ao)] 

Kh) - (Wl 
lidih) - (ao?->i)l 
[(aihi) - (aiho)] 


V2Ea/r(3 


V 2Eh/m 


V2[(a - l)Bc + EaVra 


One of the factors may be arranged in a latin square. With a 5 X 3 
design in 5 replicates, a layout of this type is shown in table 7.13. 

The statistical analysis is straightforward. For the B treatments, the 
“blocks^^ of the randomized blocks design are the rows of tlie latin square. 
The formulae given above for the standard errors are applicable (with r 
equal to a). 

7.33 Sub-unit Treatments in a Latin Squai'e. In certain cases the 
sub-unit treatments may be arranged in a latin square, with the prospect 
of a further increase in accuracy on sub-unit comparisons. There are 
two types of layout, of which the first will be described in this section; 
the second, which is used when there are only a few whole-unit treat- 
ments whose main effects are not required wdth high accuracy, is dis- 
cussed in section 8.6. 

In the first type the number of replications must be a multiple of the 
number of sub-unit treatments. For each ■whole-unit treatment, the 
sub-unit treatments are arranged in one or more separate latin squares. 
An example with 4 whole-unit treatments, 3 sub-unit treatments, and 3 
replications is shown in table 7.15. The latin squares are not very clearly 
displayed in the plan. If we examine the three units that receive as, it is 
seen that each b appears once in the top sub-unit, once in the middle, 
and once at the bottom. Thus the rows of the latin square are the posi- 
tions of the sub-units within the unit. The only change from the ordi- 
nary split-plot arrangement is that differences among these rows are 
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i 

1: 


I' 


eliminated from the sub-imit errors. If these differences are large, sub- 
unit comparisons are more precisely determined than in the ordinary de- 
sign. Wliole-unit comparisons are unaffected. 

The designs are useful where a fairly consistent gradient is expected 
within each unit. Examples have been quoted previously. In experi- 


TABLE 7.15 Split-plot design with sttb-treatmlnts in a latin square 

ag <I2 Cli Co 

Analysis of variance 
Units (i.f. 

Eeplieations 2 

A 3 

Error (a) 6 

Total 11 

Sub-units 

B.OWS . 8 ■■■ 

B 2 

AB 6 

Error (h) 8 

Total;;' 


ments on plant viruses, the sub-units may be the top, middle, and lower 
leaves of a plant, with a regular gradient of susceptibility down the 
plant. The sub-unit treatments may be operations performed in suc- 
cession by the same person, or drugs injected in succession to the same 
animal, where a time-trend is anticipated. 

The number of replicates may be any multiple of the number of sub- 
unit treatments. With 6 replicates of the design in table 7.15, the ex- 
perimental material is divided into 2 groups of 3 replicates each. If 
practicable, the division should be made in such a way that the within- 
miit gradient is constant in a given group. Since the row%s are eliminated 
separately for each group, changes in the gradient from group to group 
do not increase the error. 

In the randomization the first step is to randomize the units according 
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to the instructions for the design in whicli they are arranged. For each 
whole-unit treatment a latin square of the size required to accommodate 
the sub-treatments is then randomized as described in section 4.33. A 
new randomization of the latin squai’e is taken for each successive whole- 
unit treatment. The first column of each square is assigned to the first 
replication of the experiment, and so on. 

For extra replication the basic plan is repeated with a new randomiza- 
tion. 

Since the whole-unit analysis of variance is imaffected, it is necessary 
to show only the subdivision of the degrees of freedom among sub-units. 
Table 7.16 applies to the case in which the basic plan is repeated i times. 


TABLE 7.16 Partition of degrees of freedom among sub-units when 

SUB-TREATMENTS FORM BATIN SQUARES 


•giSi 

H! 


Hows 

Sub-unit treatments 

Sub-unit X wliole-unit treatments 

Error (5) 


di. 

ka(§ — 1 ) 

O—l) 

(a - 1)0 -- 1) 

- l)(r - 1 - Ai) 

m(/3 — 1) 


Notation: a ^ number of whole-unit treatments. 

/3 s= number of sub-unit treatments, 
r number of replications « /c/3. 

The number of degrees of freedom for rows should be noted. Since 
each latin squa-re supplies (jS — 1) degrees of freedom for rows, every 
whole-unit treatment contributes fc(/3 — 1) degrees of freedom, so that 
the total is ka(l3 — 1) degrees of freedom. The siun of squares for rows 
is calculated by addition of the sums of squares for each of the ka latin 
squares. The sum of squares among the columns of the latin squares 
constitutes a whole-unit comparison and does not appear in the portion 
of the analysis shown above. 

Before using this design it is advisable to verify that the number of 
degrees of freedom for error (h) is adequate. In the example in table 7.15 
there are only 8 d.f., as against 16 with an ordinary split-plot arrange- 
ment, From section 2.31 it appears that the sub-unit error variance 
must be reduced by at least 15% in order to compensate for the decrease 
in degrees of freedom. 

7,34 Split-plot Technique with Confounded Designs. The treat- 
ments applied to the units may constitute a factorial system arranged in 
a design that confounds certain interactions completely or partially. 
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Since experimenters are sometimes uncertain how to compute the analysis 
of variance in such cases, the following notes indicate the method. 

1. The totals for each unit are anatyzed by the instructions foi the 
confoimded design in which the units are arranged. 

2. In computing the sums of squares for the sub-unit treatments and 
for all interactions between sub-unit and whole-unit treatments, the 
confounding is ignored. Even if a whole-unit factorial effect is com- 
pletely confoimded, its interactions with sub-unit treatments are un- 
conf oiinded and may be tested by means of error (6) . 

3. The sum of squares for error (6) is found as usual by sul^tracting the 
total s.s. on the whole units plus the total s.s. for sub-unit treatments 
and for interactions between sub-unit and whole-unit treatments from 
the total s.s. for sub-units. 

A difRculty arises occasionally in the presentation of tables. Suppose 
that a 3X3X2 design is applied to the imits in blocks of 0 units, with 
AS and ABC partially confoimded, and that a factor D at 2 levels is ap- 
plied to the sub-units. In the A by B two-way table, the entries must be 
adjusted to eliminate block effects. The problem is how to present the 
ABD three-way table. The observed means (aobod[)) and (ciohodi) will 
a,verage to the uncud justed mean for (ao^o)? which contains block effects. 
The simplest process is to calculate the difference between the adjusted 
and the unadjusted means of (aoho). This difference is then applied to 
(dobodo) and (aQbQdi). Notice that this adjustment does not change the 
difference between these two entries and brings their mean to the correct 
value. 

7.35 Confounding of Comparisons among Sub-unit Treatments. 
When the sub-unit treatments are factorial, it may he advantageous to 
confound certain interactions among these factors with the units. The 
technique, first presented by Yates (7.7), is illustrated in table 7.17, 
which shows a single replication of the design. The factor A applied to 
the units has 2 levels, while the sub-unit treatments are the 8 combina- 
tions of factors C, and D at 2 levels each. The ordinary split-plot ar- 
rangement is shown in (i), while (ii) and (iii) are two alternatives for 
the new method. 

Consider first (i) and (ii). In the ordinary arrangement the unit con- 
tains S sub-units, to which the 8 sub-treatments are allotted at random. 
With the new arrangement the sub-units are in groups of 4, chosen so 
that BCD is confounded with group totals. The advantage of the new 
layout is that the we of the wbole-imit has been reduced from 8 to 4 sub- 
units. This will in general lead to more accurate estimates both of the 
A effects and of the sub-imit treatments. The only exceptions are the. 
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high-order interactions BCD mdABCD. In the ordinary arrangement 
these interactions are derived from comparisons among sub-units in the 
same imit, whereas in the new design they are estimated from com- 
parisons among the imits, as the reader may verify. 

Arrangement (iii) goes a step further. The oo and ai treatments which 
have opposite sets of sub-unit treatments are placed in the same sub- 

TABLE 7.17 Confoundino of BCD in a split-plot experiment 
i. Ordinary arrangement ii. Now arrangement 

flo Qi ^0 ^'0 fti 



bed 

b 

d 

(!) 

bd 

he 

c 

cd 


bd 

d 

c 

cd 

bed 

b 

W' 

he 


iii. New arrangement in sub-blocks 
Block la Block 16 

ao ai an o>i 



block. The result is that ABCD is completely confounded between sub- 
blocks, while BCD^ like ..4, is orthogonal with sub-blocks. The aim is to 
sacrifice information on ABCD in the hope of obtaining more precise 
estimates of A and BCD, Consequentl}^ alternative (iii) is usually 
preferable to (ii). 

With 5 replications of arrangement (iii), the partition of degrees of 


freedom is set out in table 7.18 in condensed form. There are 80 sub- 

units and 10 sub-blocks. 




TABLE 7.18 Paetition 

OF DBGKBBS 

OF PRFKDOM IN A 2 X 2^ BEBIGN 

WITH 


SPLIT-PLOT CONI'OUNDINU 


Units 

d.f. 

Bub-units 

d.f. 

Between sub-bloclcs 

9 

B, 0, D, BC, BD, CD 

6 

A 

1 

AB, AC, AD, ABC, ABD, ACD 

6 

BCD 

1 

Error (6) 

48 

Error (a) 

8 


— 


Total between units 19 


Grand total between sub-units 79 
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All sums of squares are calculated in the usual way. Note that ABCD 
does not appear, since it is a component of the sum of squares between 
sub-blocks. 

■\^Th.en the sub-unit treatments form a 2^ system, a similar technique 
may be used for any number of levels of the factor A. Single replications 
of the layouts are shown in table 7.19 for A at 2, 3, 4, 5, and 6 levels. 

TABLE 7.19 Split-plot design pok an ck X 2® factorial system 
Arrangement of whole-units in a single replication 
«>= 2;(2 X 2^) « :== 3;(3 X 2^) 

L 


a - 4;(4: X 2») 


' a - 5;(5 X2^) , . ,| 



Each unit contains 4 sub-units. A iirime (') implies that the sub-unit treatments 
are (1), he, hd, and cd, ami an unaccented a that they are b, c, d, and hexl. 


The arrangements differ slightly according as the number of levels of I 

A is even or odd. With an even level, one component of ABCD is con- I 

founded between sub-blocks. For example, with A at 4 levels, this ! 

component is (a-s + ai - a 2 — a-o) BCD, When A has an odd number I 

of levels, we are forced to confound BCD at least partially with sub- j 

blocks, and for simplicity it has been completely confounded. . | 

The randomization for a single replicate is as follo^vs, (i) Allot the 
sub-blocks a and b at random to the 2 halves into which the replicate has | 

been divided, (ii) Allot the levels of A at random to units within each || 

sub-block, (iii) Allot the appropriate set of 4 sub-treatments at random , , A I 
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within ea(Ji unit. A separate randomisation is used in each replication. 

The partition of degrees of freedom in the analysis of variance is shown 
in table 7.20 for r replications of the basic design. 

TABLE 7.20 Partition of degrees of freedom for an a' X 2® bplit-plot 



DESIGN 





Units 

Sub-units 




a even d.f. « odd d.f. 


d.f. 


Bub-blocks 

(2r-l) (2r - 1) 

Sub-treatments 

6 


4 

(a — 1) (a — 1) 

A X sub-treatments 

6(a — 

1) 

BCD 

1 

Error (6) 

6a'(r — 

1) 

ABCD 

{a - 2) (o; - 1) 

Total 

6a:r 


Error (a) 

\ 

1 

1 

1 




Total 

(2ca- - 1) {2ar - 1) 





When o; is odd, the sum of squares for BCD is omitted from the treat- 
ments; otherwise all sums of squares are obtained in the usual way. 
With a even, 1 di. is missing from ABCD, since it is confounded with 
sub-blocks. Since these designs are unlikely to be used unless ABCD is 
small, it will usually be satisfactory to combine the ABCD sum of 
squares with error (a). The pooled sum of squares can be found by sub- 
traction without calculating that for ABCD. 

A considerable number of arrangements of this type are possible, 
though some care is required in their construction in order to avoid 
confounding important comparisons. Finney (7.8) gives an account of 
the principles of construction and designs mth worked examples, for a 
4X2^ and a 6 X 2^ factorial, where the units are in a 4 X 4 and 6 X 6 
latin square, respectively. Yates (7,7) gives a similar design for 6 X 2‘^ 
factorial. 
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CHAPTER 8 

FACTORIAL EXPERIMENTS CONFOUNDED IN QUASI-LATIN 

SQUARES 

8.1 Introduction 

Yates (8.1) has constructed a number of confounded arrangements for 
factorial designs in latin squares. In the first group of designs (plans 
8. 1-8.7), some of the interactions among the factors are confounded with 
the rows and columns of the squares. These designs resemble ordinary 
latin squares in that the differences among the rows and columns of the 
squares are eliminated from the experimental errors of the uncoiifounded 
treatment effects; however, the designs do not possess the typical latin 
square property that each treatment appears once in every row and every 
column. Yates (8.1) has proposed the name quasi4aim squares. 

Before one of these plans is used, it is advisable to note the interactions 
that are confounded and the extent to which they are eonfoimded, par- 
ticularly since more treatment effects must be confounded with latin 
squares than with randomized blocks. Of the eight plans shown, only 
one involves the confounding of two-factor interactions. 

In the statistical analysis, the sums of squares for rows, columns, and 
all unconfounded treatment effects are calculated in the usual way. 
Partially confounded effects, if considered negligible, may be combined 
with the error, which is found by subtraction; or they may be tested for 
significance by calculating their sums of squares from those portions of 
the experiment in which the effects are unconfounded. 

In a further group of designs (plans 8,8-8.14), certain treatments are 
applied to complete rows or columns of a latin square. These designs 
have been called half-plaid squares or plaid squares, from the resemblance 
of the plan to a Scotch plaid. 

8,2 Randomization of Quasi-latin Squares 

The rows and the columns of each latin square are rearranged at ran- 
dom. In certain plans (e.g., 8.2) each square contains several replica- 
tions. The randomization must not be restricted in such cases with the 
object of keeping the replications separate. . , =, ■ 
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QUASI-LATIN SQUARES 
8.3 Notes on the Plans and Statistical Analysis 


8.3 


signs, of which two are given in plan 8.1a and 8.15. In the former, who 
^ j. vAnlifitions Vo relative information is retained on AB, At, 

iir'^Ld ABC, main effects being unconfounded. On accoimt of the high 
tofoF confounding of the two-factor interactions, this plan is not 
recommended unless the factors are presumed to act independently oi 
imless the 4 X 4 square is considered much more accurate than a ei- 

example illustrates the general method of analysis for aU 
deigns presented here. The data in table 8.1 represent twnce the yields 

TABLE 8.1 Yields ano statistical analysis or a 2^ factubial in two 
4X4 quasi-latin sqxjakbs 


Square I 

ABC AC 

0 (1) abc ah Totals 

57.9 02.3 60.2 68.6 239.0 

a at, b be 
66.2 60.5 60.0 53.4 240.1 


Square II 
AB BC 


AS 


c a abe b Totals 

62.4 64.6 68.5 63.1 258.6 

(1) at be ah 

61.5 62.7 68.6 69.3 252.1 


ABC 


abc be (1) a, 

04.1 61.3 63.2 69.2 247.8 

h ah ac c 

58.4 61.8 67.0 61.9 240.1 


BC 


abc b (1) 

66.0 66.5 63.5 67,3 253,3 

ah he a c 

65.9 62.6 68.1 58.1 234.7 


AO 


Totals 246.6 245.0 250.4 233.1 976.0 


Squares 
Rows 
Columns 
A R n 


d.f, 

1 

6 

6 


255.8 236.4 248 7 257.8 998.7 

S.S. in.s. 

16.10 
100.90 
112.71 

1AK 4.3 


16.82 

18.78 
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in bushels per acre from a soibtreatment test on soj^teans, at Muscatine, 
Iowa. The plots were split for a varietal comparison which has been 
omitted. For convenience, rows and columns were rearranged so that 
the field plan corresponds to plan 8.1a. The factors were limestone (A), 
phosphorus (jS)j and potash (C) ; in each case the lo wer level implies no 
application of the treatment. 

To obtain the sums, of squares for the treatment effects, first calculate 
the factorial effect totals in the usual way. These are shown in the first 
line of table 8.2. For the partially confounded effects, these totals must 


TABLE 8.2 Treatment effects for a 2^' pactoriau in 4 X 4 

QITASI-LATIN SQUARES 



lA] 

[B] 

ICl 

lAB] 

[AC] 

[BC] 

[ABC] 

Totals 

+66.3 

-18.1 

-9.7 

+35.7 

+34.9 

- 8,1 

- 4.3 

From rows and columns 




+ 18.3 

+35.9 

- 10.4 

+ 7.2 

Adjusted totals 




+17.4 

- 1.0 

+ 2.3 

-11.5 


be adjusted. Now AJ5 is completely confounded with the first 2 rows of 
square I and the first 2 columns of square II. Calculate AB as obtained 
from these rows and columns, i.e., 

239.0 - 240.1 + 255.8 - 236.4 = +18.3 


This figure is placed in the second row of table 8.2, with corresponding 
figurCvS for AC, BC, and ABC, The adjusted totals are found by sub- 
tracting the second from the first row. Of course, these represent Just 
the totals for each effect as found from that part of the experiment in 
which the effect is unconfoundcd. The divisors for the squares are 32 
for main effects and 16 for the other effects. 

The two-way tables of means require adjustment. For the AB table 
the adjustment is 


(+17.4) (+35.7) 

^'"i6 32 


--0.03 


This value is added (algebraically) to the (1) and {ah) means and sub- 
tracted from the (a) and (b) means. 

Plan 8.12) completely confounds ABC with squares; all other effects 
are unconfounded. The arrangement has the peculiarity that half the 
treatments appear only in square I, while the other half appear only in 
square IL Although this layout avoids the confounding of any two- 
factor interactions, Yates has pointed out that any interaction of the A 
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effect with squares will appear in the analysis as a BC interaction. The 
results are therefore likely to be misleading if the experimental material 
in square I differs in responsiveness from that in square II. 'WTiere this 
danger seems likely, Yates suggests that the colmnns (or rows) of the 
two squares may be interlaced, so that each square comprizes a ran- 
domly chosen half of the experimental material. A numerical example 
of the statistical analysis is given by Yates (8.1, p. 33), using uniformity 
data. 

2* factorial in an 8X 8 square. Four replications are required for plan 
8.2, ABCD is completely confounded, and threefold implication is ob- 
tained on all three-factor interactions. The degrees of freedom sub- 
divide as follows. 

d.f. 


Rows 

Columns 7 

Treatments 14 

Error 35 


In the computation for ABC, ABB, ACB, or BCB, the appropriate pair 
of rows is deleted. 

factorial in an 8X8 square. The confounding in plan 8.3 (for 2 
replications) is not completely balanced. Eight of the 10 three-factor 
interactions and four of the 5 four-factor interactions receive only single 
replication. The remaining high-order interactions, ABB, CDE^ ABBB, 
and ABCBB, are unconfoimded. By a proper assignment of the letters 
a, • • • to the factors, any single three-factor interaction may be left 
unconfounded, though not any desired pair of interactions. 

The partition of degrees of freedom is: 


Rows 

d.f. 

7 

Columns 

7 

Treatments 

31 

Error 

18 


In the computation of the sum of squares for ABC, ABB, or BCBB, the 
4 rows in the experiment which correspond to the first 4 rows of plan 8.3 
are omitted. The 4 rows or columns to be omitted for any other partially 
confounded comparison are found similarly from plan 8.3, If the high- 
order interaction effects are likely to be negligible, all partially con- 
founded effects and ABCD and ABODE may be combined with the 
error. This procedure gives 17 di. for treatments and 32 for error. 


8.3 


NOTES ON PLANS AND STATISTICAL ANALYSIS 


24-5 


fackn-id in an 8 X 8 square. In each square of plan 8.4, 8 three- 
factor and 6 four-factor interactions are completely confounded. If the 
experiment has a single replication and if only the main effects and two- 
factor interactions are isolated for tests of significance, the degrees of 
freedom are allotted as follows. 

d.f. 

Bows 7 , 

Columns 7 

Main effects ; 

Two-factor interactions 15 

Error (estimated from liigh-order interactioiis) 28 

Any imconfoiinded three-factor interaction may also be tested. 

When there are several replications in the experiment, the squares 
should be chosen so that the most important three-factor interactions 
are free from confounding. The balanced design, which requires 5 repli- 
cations, gives 3/5 relative information on all three- and four-factor inter- 
actions. 

3'^ factorial in a 9 X 9 square. Three or six replicates may be msed 
(plan 8.5). Four of the 8 di. for ABC are completely confounded in each 
square. The 2 squares together give balanced confounding of ABC^ with 
1/2 relative information. 

When square I alone is used, the components of the analysis of vari- 
ance are: 


Rows 

d.f. 

8 

Columns 

8 

A, B, C 

6 

AB, AC, BO 

12 

ABC 

4 

Error 

42 


The’ 4 di. for ABC are evaluated as follows. Compute the total yiehls 
of the 3 groups of 9 treatments which are described as set il (a), (c), and 
(5) in square 2. The sum of squares of deviations of these 3 totals, with 
divisor 27, constitutes 2 of the 4 d.f. The remaining 2 are found similarly 
from set IV (a), (5), and (c) in square 2. 

With both squares, all 8 di. for ABC may be computed, sets I and III 
being taken from the experimental data for square II, and sets II and I\ 
from the data for square I. 

factorial in a 9 X 9 square. Each square in plan 8.(5 provides 1/2 
relative information on ABC^ ABD, ACD^ and BCD, For an experiment 
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with a single replication, the simplest subdivision of the degrees of free- 

dom is : 


di. 


Rows 

8 


Columns 

8 


Main effects 

8 


IVo-factor iiiteractioas 

.24 ■ 


Error (estimated from high-order interactions) 

32 


When 2 replicatiom are available, both squares should be used. All 
S di. for any three-faetor interaction may be tested, though each is based 
on only a single replication. The calculation of the sums of squares for 
the three-faetor interactions follows the method outlined for the 3'^ fac- 
torial in blocks of 9 units (section 6.32). Plan 8.6 indicates the replica- 
tion from which aii}^ component must be taken. Thus, if ABC is being 
calculated, we use the data from square 11 for ABC II and ABC IV and 
the data from square I for I and (7 III. 

4 X. 2^ factorial in an S X S square. Plan 8.7 requires 4 replications. 
The symbols A', A", refer to components of the A effects. 

A^ == Ua H- (12 — Ui — Goi A^^ == ug — ^2 Ui -P uoj 

A'" - as — a2 + cii — uq 

No two-factor interaction is confounded. 

In view of the small relative information on ABC (only 1/3 on the 
avex’age), the sum of squares for this term v'ill usually be combined with 
the error. In this event the analysis of variance reads avS follows. 

■ d.f. 


Rows 7 

Columns 7 

Main effects and two-factor interactions 12 

Error (by subtraction) 37 


BA Other QuasiAatin Squares 

Yates (8 A) gives experimental plans which can be used for the follow- 
ing factorial combinations: 3^ X 2 (6 X 6 square), 4X2'^ (8X8 
square), and 9^ (9 X 9 square). Analogous to the last design are designs 
for 7^ (7 X 7 square) and 5^ (5 X 5 square). In a general discussion of 
methods for constructing such plans, Rao (8.3) presents designs for a 2^ 
factorial in 4 X 4 squares. 

For certain factorial combinations of treatments, the lattice squares 
(chapter 12) and the incomplete latin squares (chapter 13) confound all 
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treatment effects to tlie same extent. These arrangennents may be use- 
ful in cases where the experimenter does not wish to sacrifice information 
on the interactions. 

8.5 Estimation of the Efficiency of Quasi-latin Squares 

In many cases the precision of a qiiasi-latiix sqiiai’e can be Compared 
with that of some other design which might have been used. The data 
from the example in table 8.1 will illustrate two types of comparison. 

In this experiment a 2^ factorial system applied to plots of soybeans 
was arranged in two 4 X 4 squares, with 1/2 relative iiifoiiimtim^ on 
AB, AC, BC\ nnd ABC, As an alternative, the experiment might have 
been laid out in randomized incomplete blocks of 4 plots each, witii 
complete confounding of ABC (plan 6.1). Since the columns were more 
compact in shape than the rows, we will suppose that the columns would 
have been chosen as blocks. 

We first estimate the error m.s. that would have been obtained with 
incomplete blocks. It is helpful to construct analyses of variaiiee for the 
2 designs (table 8.3) for the case where there are no treatments. The 

TABLE 8.3 Analyses of variance (applicable to uniformity data) 


Quasi-lat in square Incomplete blocks 



d.f. 

m.s. 


d.f. 

m.s. 

Columnjs 

6 

IS. 78 

Blocks ((joluimis) 

6 

18.78 

Rows 

6 

10,82 

Intra-blocks 

24 

(11.15) 

Iiifcra-row-and-€olunin 

IS 

9.20 





mean squares to be inserted in the table are explained in the next para- 
graph. 

With incomplete blocks, the degrees of freedom to be divided between 
treatments and error are the 24 d.f. within columns (3 from each of the 
8 columns). Since the randomization ascribes to treatments a random 
selection from these 24 d.f., the mean square for the 24 d.f. provides an 
unbiased estimate of the error appropriate to the incomplete block de- 
sign. The 24 d.f. subdivide into 6 representing variations among rows, 
and 18 representing intra-row-and-column variation. If the interactions 
partially confounded with the rows are regarded as negligible, the rows 
m.s. (16.82) from the quasi-latin square analysis provides an estimate of 
tlie 6 d.f. The quasi-latin square analysis does not supply information 
on. the whole of the remaining IS d.f., since in the experiment 7 of these 
were ascribed to treatments. By virtue of the randomization for the 
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(j[uasi-latiii. square, however, the error m,s. (9,2{)), 'vith 11 d.f., may be 
taken as an estimate of the mean square for the whole 18 d.f. 

Accordingly, an unbiased estimate of the erroi* m.s. for the iiicomplein 
bloclv design is 

18 X 9.26 + 6 X 16.82 


24 


11.16 


For the estimation of main effects, the efficiency of the qixasblatin 
square relative to incomplete blocks is estimated as 11. 15/9. 2G, or 120%. 
Owing to the confounding in the quasidatin square, the relative efficiency 
on the interactions is onij^ 60%. 

The quasidatin square may also be compared with ordinary ran- 
domized blocks of 8 plots, where each replication is a pair of columns. 
Ill table 8.1 the column totals are as follows. 


Rep. i 

240.0 245.9 


Rep. 11 
250-4 233.1 


Rep. Hi 
255.8 230.4 


Rep. IV 
248.7 257.8 


With uniformity data, the randomized blocks error contains 28 d.f, 
(7 from each replicate). To obtain this error, the incomplete blocks 
error (24 d.f.) must be pooled with the 4 d.f. which measure differences 
l^ietween the 2 columns in each pair. Since the differences between the 
2 column totals in each replication are 0.7, 17.3, 19,4, and 9.1, respec- 
tively, the sum of squares for the 4 d.f. is 


(0.7)- + (17.3)'' + (19.4)- + (9.1)- 


94.87 


This gives [24(11.15) + 94.87]/28 = 12.95 as an estimate of the random- 
ized blocks error. The relative efficiency of the qiiasi-latin square is 
140% on main effects and 70% on interactions. 

The comparisons above require the assumption that treatment effects 
which are confounded w'ith rows or columns are non-existent. By more 
complicated procedures, estimates can be made which a.void those as- 
sumptions. The relative efficiencies quoted above are slightly too high, 
because we neglected the effect of the additional error d.f. that would 
be available in the alternative designs (section 2.31). 


8.6 Treatments Applied to Complete Rows of a Latin Square 

. A simple example (taken from plan 8.8) is shown below. The latin 
square treatments form a 2 X 2 factorial for factors B and (7. An extra 
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factor A has been superimposed on the rows. The first 2 rows receive 
the lower level of A, and tiie last 2 rows the higher level. 

ABC ■ 

(~) h c ( 1 ) Im 

H c h he (1) 

« (1) be h e 

a he (1) c h 

This plan might be tried in a case where the extra factor was not suit- 
able for «application to individual units. No precise estimate of the ef- 
fect of d. would be expected, but some useful information on the inter- 
actions between A and the other factors might be hoped for. However, 
this device has one property that necessitates caution in its use. When- | 

ever an extra factor is superimposed on the rows, certain interactions f 

between that factor and the latin square treatments are automatically J 

confounded with columns, as noted by Yates (8.2). In the plan above, i 

the interaction i 



{aim) + (a) + (6) A {c) — (1) 


I 


is the difference between the first 2 columns and the last 2 columns. On | 

the other hand, AB and AC are free of column effects. In the plans pre- i 

sented here, wffiich are due to Yates (8.1), such confounding is confined | 

as far as possible to high-order interactions. The reader wlio cuiistructs 
liis own designs should verif}^ before using them, wdiat interactions have i 

been confounded in this way. i 

As indicated above, these designs are useful in experiments where (i) f 

it is not easy to apply factor A to individual units, and (ii) aiunxrate ■ 

estimates of the interactions of A , but mh of its main effects, are wanted. ^ 

For instance, in an experiment on an irrigated crop, we might Avish to j 

find out how the responses to plant nutrients B and C are affected by the 
level of the water supply. Fac^tor A could then represent a restricted | 

and an abimdant amount of irrigation water, applied to whole rows of the j 

square. i 

These halpplaid squares are related Ixoth to quasi-latin squares and to ' I 

split-plot designs. The difference from quasi-latin sejuares is that in the | 

latter the confounding is confined to interactions, whereas main effects 5 

also are confotinded in half-plaid squares. The connection with split- j 

plot designs is seen when the rows are regarded as the experimental , i 

units, and the units (or plots) as sub-units. From this point of view the 
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plan above be considered as a split-plot design Avith the sub-units 
arranged in latin squares. 

A moderately large number of useful half-plaid squares can be con- 
structed; only five of the simplest are given here. In the notation em- 
ployed, the factorial system for the latin square treatments is placed in 
parentheses ( ), while the number of treatments imposed on the rows 
is shomx before the parentheses. Thus plan 8.8 shows a 2 X (2^) fac- 
torial. 

As with an ordinary latin square, rows and columns must both be per- 
muted at random. The whole-row treatments cannot be arranged in 
separate blocks. 

The statistical analysis may be followed from the partition of the de- 
grees of freedom, which is given in condensed form with each plan. As 
in the split-plot design, there are two errors. The error for the factor A. 
applied to the rows is derived from the sum of squares among rows minus 
the sum of squai^es for A. 

Most of the other treatment effects are unconfounded. Their sums of 
squares are calculated in the usual way and tested against the latin 
square error. In plans 8.9 and 8, 10 certain interactions are parfdally con- 
foxinded with columns; the calculations for these effects are described 
in the notes below. In the remainder of the plans, some interactions are 
completely confounded with rows or columns and consequently do not 
appear in the treatments s.s. The error s.s. is foimd by subtraction. 

8.61 Notes on the Plans and Statistical Analysis. £ X factorial 
m aJf X 4 square (plan 8.8) . At least two squares (4 replications) should 
be used. 

2 X X .^) factorial in a6 X 6 square (plan 8.9). Each square con- 
tains 3 replicates. BC and ABC are partially confoimded with columns, 
the relative infoi'mation being 8/9 and 5/9, respectively. The sums of 
squares for these effects are calculated by the procedure for the 3 X 2^ 
factorial in incomplete blocks of 6 units, where the columns constitute 
the blocks. A numerical example is given by Yates (8.1, p. 58). Col- 
umns la • • ’ III6 are identical, respectively, with Yates’ blocks la • * • 
III5. Notice that the factor applied to the rows is C. 

S X (S X factorial in a6 X 6 square (plan 8.10). Each square re- 
quires 2 replicates, while the 2 squares comprize a balanced set. Col- 
umn effects are partially confounded with AB and ABC, the UA^erage 
relative information being 7/8 and 5/8, respectively. If the columns are 
regarded as incomplete blocks, the numerical example in section 6.19 
may be followed for the computation of the sums of squares for AB and 
ABC. 
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The statistical amdyses for ^ 2 X (2'^) factorial (plan 8.11) and the 

2 X (2^^) factorial (plan 8,12) in 8X8 squares are straightforward. In 
the latter, the BCDE and ^4 BCD® interactions have been combined 
with the error for x4. 

Other designs, Yates (8.1, p. 79) presents the plans for a 4 X (2*^) fac- 
torial in an 8 X 8 square and (8.1, p. 80) for a 3 X (3'^) factorial in a 
9 X 9 square. Plans can also be constructed for a 2 X (2‘'') system in a 
8 X 8 square and for a 3 X (3^) system in a 9 X 9 square. 

8.7 Treatments Applied m Complete Rows and Columns of a Latin 

Square 

In these designs one set of treatments A is applied to complete rows 
of the latin square and another set B to complete columns. Altliough 
relatively low precision is obtained on tlie row and column treatments, 
the latin square treatments and their more important interactions with 
the A and B treatments are subject only to the latin square error. The 
number of useful designs appears to be limited. 

Tw'o examples are shown in plans 8.13 and 8.14. The latin square 
treatments form a 2'^ and 2^ system, respectively; the additional treat- 
ments are both at 2 levels. No two-factor interactions are confounded. 
The rows and columns of each square should be completely randomized. 

The subdivision of degrees of freedom is indicated on each plan. In- 
teractions which ai’e confounded with rows and columns must be omitted 
from the treatments s.s.: all other treatment effects are calculated in the 
usual way. In plan 8.14, where the square contains a single replication, 
the only available estimate of error comes from the high-order inter- 
actions, unless two or more squares arc used. 

A 9X9 sc(uare for a 3“ sysiem of treatments, with row and eoliimii 
treatments also at 3 levels each, is given in reference (8.1), p. 81. 
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ABC cc)iup](4(3ly confounded with squares 
S(|uare 1 Squaie II 


(1) be ac ah 

ac ab (1) be 

he (1) ah ac 

ab ac he (1) 


h a aha c 

ahe c . a h 

a h a aha 

c aha b a 


Plan SJ 


2^ factorial in an 8 X 8 quasi-latin square 
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CHAPTER 9 

BALANCED AND PARTIALLY BALANCED INCOMPLETE 

BLOCK DESIGNS I 

9.1 Balanced Designs [i 

As their name implies, these designs, introduced by Yates (9.1), are :| 

arranged in blocks or groups that are smaller than a complete replica- ! J 

tion, in order to eliminate heterogeneity to a greater extent than is pos- 
sible with randomized blocks and latiii squares. In the designs de- '.I 

scribed in chapters 6 to 8, this reduction in the size of block was acliieved 
by sacrificing all or part of the information on certain treatment com- :i| 

parisons. The present designs, on the other hand, were developed for 
experiments in plant breeding and selection, where it is desired to make 
ail comparisons among pairs of treatments with equal precision. Con- j 

seqiiently, a different method for reducing the size of block is employed, J 

The designs may be arranged either in randomized incomplete blocks 
or in quasi-latin squares. They may be balmwed or partially balanced, , 

The balanced designs will be illustrated first by simple examples of the 
experimental plans. 

Consider the plan in table 9.1 which compares 9 treatments in in- 
complete blocks of 3 experimental units with 4 repli(!ations. 

Every pair of treatments will be foimd to occur once, and only once, 
in the same block. For instance, treatment 1 occupies the same Idock 

TABLE 9.1 Balanckd design for 9 treatments in blocks of 3 units 

Bl<X!k Rep. I Rep. II Rep. ill Rep. IV 

(1) I 2 (4) (7) Txx (10) rxo 

: (2) 4 5 6 T5)''. 2^' 

with treatments 2 and 3 in the first replication, with treatments 4 and 7 
in the second replication, with treatments 6 and 8 in the third replica- j 

tion, and with treatments 5 and 9 in the fourth I’eplication. When the | 

results are analyzed by the method of least squares, this property, to \ 
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which the adjective balanced is applied, ensures that all pairs of treat- 
ments are compared with approximately the same precision, even though 
the differences among blocks may be large. This design belongs to the 
group known as balanced lattices, so-called because the plans are con- 
veniently written down by drawing a square lattice, with the treatment 
numbers at the intersections of the lines. In the balanced lattices, the 
number of treatments must be an exact square while the number of units 
per block is the corresponding square root. 

Balanced designs can be constructed for other numbers of treatments 
and of units per block. The plan in table 9.2 shows 7 treatments ar- 
ranged in blocks of 3 units. 


TABLE 9.2 Balanced design roa 7 anBATMENTS in blocks op 3 units 
Block 


(1) 1 2 4 

(3) 3 4 6 

(5) 

1 5 6 

(7) 1 3 7 

(2) 2 3 5 

(4) 4 5 7 

(6) 

2 6 7 



Again every pair of treatments occurs once within some block. In 
this case, however, the blocks cannot be grouped in separate replications, 
since 7 is not divisible by 3. Designs of this type are known as balanced 
incomplete blocks. 

For certain numbers of treatments and units per block, both the types 
above can be laid out in a kind of latin square formation so as to alloAv 
the elimination of variation arising from two types of grouping. The 
appropriate rearrangement for the first example is shovm in table 9.3. 

TABLE 9.3 Balancbo design fob 9 teeatmentvS in 4 lattice squabes 

Rep. I Rep. 11 Rep. Ill Bep. IV 
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Rows (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) 
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3 


It may lie verified by inspection that every pair of treatments now 
occurs once in the same row and also once in the same column. All 
comparisons between pairs of treatments are of nearly equal precision. 
This design is known as a laiiice square. 

The second example is rewritten somewhat differently. 

Every treatment now appears in each of the 3 rows, and every pair of 
treatments appears together once in the same column. Since the plan 
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TABLE 9.4 Balanced design for 7 treatments in an incomplete 
LATIN square 


(1) 

Columns (Blocks) 
(2) (3) (4) (5) 

(ft) 

(7) 

Rows 
(1) 1. 

2 . 

3 4 

5 

6 

7 

(2) 2 

3 

4 5 

6 

:r 

1 

(3) 4 

5 

6 7 

1 

2 

3 


above could represent the first 3 rows of a 7 X 7 latin square, this type 
of design has been called an incomplete latin square, or alternatively a 
Youden square, after W. J. Youden, who developed these designs for 
greenhouse experiments. 

9.2 Partially Balanced Designs 

Although a balanced design can be constructed with any number of 
treatments and any number of units per block, the minimum number of 
replications is fixed by these 2 variables. In most cases this number is 
too large for the usual conditions of experimentation. In order to allow 
more freedom of choice in the number of replicates, designs which lack 
the complete symmetry of the balanced designs must be used. 

The most useful of such designs are the lattices. The.se are constructed 
in the same way as balanced lattices except that them are fewm jeplica'- 
tions. The design with 2 replications (e.g., the first 2 replications in 
table 9.1) is called a simple lattice, and that with 3 replications a trj,plc 
lattice. Bimilarly, with a lattice square, as in table 9.3, we may use les.s 
than the full number of replicates neces.sary for balance. For all these 
designs the number of treatments must be a peifect square. 

A further set, called aihic lattices, is useful when the number of treat- 
ments is very large. The number of treatments is the cube of the num- 
ber of units per block, .so that a drastic reduction m block size is obtained. 
The number of replicates is 3 or some multiple of 3. 

Partially balanced designs are less suitable than balanced designs in a 
number of ways. The statistical analysis is more complicated. It is 
mainly for this reason that we have not included any partially balanced 
designs corresponding to those in tables 9.2 and 9.4. imther, when the 
variation among blocks (or rows and columns) is large, some pairs of 
treatments are more precisely compared than others, so that several 
standard errors are reqtiired for tests of significance. Also, the overall 
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precision of the experiment is decreased by the lack of symmetry. This 
means that under comparable conditions a partially balanced design 
gives a slightly higher standard error per replication than a balanced de- 
sign. These disadvantages are of minor importance in lattice designs, 
which will be found almost as convenient as balanced designs. 

9,3 Basis of the Statistical Analysis 

9.31 AnaiyvSis without Recovery of Inter-block Information. To those 
who have previously used only the simplest designs, one feature of the 
incomplete block designs will be unfamiliar. The treatment averages 
must be adjusted in order to secure the full accuracy available in the de- 
sign. That this is necessary can be seen by examining, for example, the 
balanced lattice in table 9.1. In three of the 4 replications, treatments 
1 and 2 are in different blocks. Consequently, if the simple averages of 
the 2 treatments are compared, the effects of differences among blocks are 
eliminated only for the first replication. 

Yates developed two methods of analysis for the designs. The first 
is an analysis by standard least squares, reference (9.1), sometimes re*- 
ferred to as the -'intra-block^^ analysis. Later he showed that com- 
parisons among the block totals also contained information about treat- 
ment effects tliat can be utilized in the larger experiments. This ap- 
proach is analogous to that in split-plot experiments, where it will be re- 
called that use is made both of comparisons within whole-units (corre- 
sponding to incomplete blocks), and of comparisons among the totals of 
different whole-units. In this section the original method of analysis 
will be outlined for balanced lattice experiments. This method has not 
been entirely superseded by the newer anal^^sis, because it must be used 
with small experiments. The newer analysis is sketched in section 9.32, 

For the original analysis the mathematical model and the assumptions 
are essentially the same as those for previous designs in this book, as 
described in section 3.2. In the balanced lattice there are treatments 
in blocks of h units, with {k + 1) replications. Let yijq be the observa- 
tion for the qtk treatment, which we suppose to be in the jth block within 
the fth replication. The model is 

Vm = M + + rq + eijq (9.1) 

where pi, tt^*, and Tq represent the effects of the mean, the replicate, 
the incomplete block, and the treatment, respectively, and e-ijq is the 
intra-block residual or error, assumed to be normally and independently 
distributed with mean zero and variance As with previous designs, 
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we find estimates of the parameters by minimizing 
(yijq Pi ^ij iq)^ 

subject to the usual relations 

Z) Pi = 0j Y^ii = 0 (for any i), Y = 0 (9-2) 

■i' 3 . q 

The normal equation for any unknown is obtained as before by equat- 
ing the observed total to the expected total over all units whose equation 
contains the constant. Tluispif denotes the total for ail (/€ + 1) 
units that receive the qth treatment, the normal equation for is 

Tq- = (/ij + l)m + + (h + l)tq (9.3) 

the terms in the having disappeared in virtue of aquations (9.2), since 
any treatment appears in afl replicates. 

This equation does not at once give the value of tq^ since it still contains 
the imknoAvns m and From the normal equation for m, it is easy to 
see that m is the average over the whole experiment. The normal equa- 
tion for any bij is 

Bij =: M + fcjh* + + Htq (9.4) 

where Bij is the observed block total, and the treatments sum is over the 
k treatments that are in the block. If we add these ecpuitions for all 
blocks that contain the qth treatment, we obtain 

Bt = kik + l)m + /cE hj + Y E (9-5) 

where Bt is the total of all such blocks. 

It is at this point that the structure of the design is important. In 
the blocks that contain tq^ every other treatment occurs once and only 
once, while tq appears {k + 1) times. Hence 

13 X) + X == + X ^ 

from (9.2). 

This simplifies (9.5) to 

Bt - k(k + l)m + hj + Mq (9.6) 

We may now eliminate the unwanted term XI (^*3) and (9.6), 

giving 

kTq - Bt = k% (9.7) 

a relatively simple solution. In effect, we simply adjust the treatment 
total for the total of the blocks in which it appears. 
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Since tq represents the cfeuiafzon of the estimated treatment mean from 
the mean for all treatments, it is customary in practice to compute 
{tq + m), which represents the estimated mean itself. A little algebraic 
manipulation from (9.7) shows that 


tq + fn 


Tq , ^ + 1)^^ + G] 


k + 1 


+ 


k%k+ 1 ) 


(9.8) 


where G — k^(k + l)m is the grand total for the whole experiment. 
Note that Tg/{k + 1) is the simple or unadjusted treatment mean. The 
last term on the right therefore represents the adjustment that is applied 
to the ordinary mean. This result will be used in a later comparison with 
the newer analysis. 

A continuation of the analysis shows that the variance of the diif erence 
between two tq values is 2ae^/k, If the experiment had been in ran- 
domized blocks, with the same number of replications, {k + 1), the cor- 
responding variance would have been 20 - + 1), where cr^ m the error 
variance for randomized blocks. Hence the new design, with this method 
of analysis, gives a more accurate experiment than randomized blocks if 
and only if 


< 


k 


k+1 

The quantity k/{k + 1) is called the efficiency factor of the design. Note 
that the incomplete block design would be less accurate than randomized 
blocks if the variation among incomplete blocks were as great as that 
within complete replications. With the newer method of analysis this 
disadvantage Is largely removed. 

9.32 Analysis with Recovery of Inter-block Information. This analysis 
rests on more difficult theory and will be outlined only in part. Formally, 
the mathematical model is exactly the same: 


Vijq = + Tti + Tq + 


{9.9) 


However, the additional assumptions are made that the block effects 
§ij are normally and independently distributed with zero means and 
variance and that they are independent of the etjq. It follows from 
the assiunptions that for the difference between two observations in the 
same block, the residual variance is since the %’s cancel, whereas 
for the difference between two observations in different blocks, the 
residual variance is 2 ((t/ + The reader will note the analogy with 
split-plot experiments (section 7.12). With regard to the justification 



I 

I: 


for the additional assumptions, it can be said that in agricultural field ex- 
periments, for which the designs were fiirst developed, it is usually just 
as reasonable to make the assumptions for the % as for the eij^. The 
assumptions cannot be taken for granted, and with certain types of data 
we might not wish to make them. 

As a result of the assumptions, observations that are in the same block 
are positively correlated, so that the simplest type of least squares 
e>stimatioii cannot be applied. Instead, we may write down the joint 
fi^equency distribution of all the observations, and use the more general 
method of estimation known as maximum likelihood. By a well-known 
devif3e in theory, maximum likelihood estimation may be shown to be 
equivalent to the minimization of a weighted sum of squares, consisting 
of two parts. The first part is the sum of squares of deviatious of the 
residuals from their block means; the second is the sum of squares of the 
residuals of the block totals. The two parts receive weights w and w'/k^ 
respectively, where 





For the moment we suppose that w and w' are knowm. For the balanced 
lattice, the quantity to be minimized is 

■ : /C . ■ . ■ , ■ 

(9.10) 

where yij.Mj, denote, respectively, the observed mean of a block and the 
mean of the f s that occur in the block. The first sum is over all observa- 
tions, the second over all blocks. 

When we differentiate with respect to a given tq, we must note that the 
term iij, will contain *with coefficient I/fc, whenever the block in ques- 
tion contains the ^th treatment. The deiivative, when e(]uated to zero 
(omitting a factor —2), gives 

. ... w' ■ ' 

2 Vir ^(Z + ^^7-] + Iw 7~ [-^*7 “““ ^ 

‘ (9.M) 


The first sum is over the observations that receive the ^/th treatment; 
the second over the blocks that contain this treatment. The derivative 
contains an additional part arising from the terms in kj. in the first 
bracket of (9.10), but this contribution will be found to vanish. 

The next step is to simplify (9.11). Both brackets involve a sum of the 
terms • In each case, because of the symmetry of the balanced lattice, 


260 INCOMPLETE BLOCK DESIGNS 

the sum contains all other t’s once, and tq (k 1) times. Hence 
(fc + l)tq + (rest of t’s) kt„ 


9.32 




k 




since the total of all t’s is zero. Further, we maj'- note that the observed 
block mean y;j. equals Bij/k. Writing as before Tq for the treatment 
total and Bt for the total of all blocks in which the treatment appears, we 
have from (9.11) 


w 


Ta 


k 


(k + 1)^4 + tq 


W 

+ T‘^‘ 


or 


{kw + io')tq + {k + = 'wTq ~ (:w - w^) 


k{k + l)??^ — ktq] =« 0 

(9.12) 
Bt 


k 


(9.13) 


The constant m may be sliown to be the mean for the whole experiment, 
or G/k^{k + 1). After some rearrangement we may express (9.. 1 3) in a 
form comparable with the original estimate given in (9.8), as follo^vs. 


tq + m === 


Ta 


k + 1 




(to -- w')[kTq ^ {k + \)Bt + G] 
k(Jc + 1) (A)ie~ + w') 


(9.14) 


The adjustments are seen to be of the same form in the new as in the 
original analysis. The two are equal when is zero. This occurs when 
{Xb^ is very large; that is, when differences among blocks are great. In 
all other cases the adjustments are smaller with the new than with the 
original analysis. They reduce to zero when ty' = w, which happens 
only if (Tb^ is zero, or in other woi'ds if there are no real differences among 
blocks. Thus the new analysis is a generalization of the original anal- 
ysis, reducing to it in the extreme case where there are marked variations 
among incomplete blocks. At the other extreme, when the arrangement 
into blocks has been ineffective, the new analysis makes no adjustments 
for the non-existent block dift*erences, and in fact reduces to an analysis 
by the method for randomized blocks. 

In practice the weights w and are not known. Yates has shown 
that they can be estimated from the analysis of variance, in the same v- ay 
as we estimate two separate errors in a split-plot experiment. The de- 
tails will not be given. The fact that weights are estimated rather than 
exact introduces some additional sampling variation into the adjust- 
ments. This is unimportant in the larger experiments, but in certain 
of the smaller experiments the weights cannot be estimated accurately 
and the original analysis is recommended. For other accounts of the 
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anatysis by different approaches, see references (9.2), (9.3), (9.4), and 
(9.5)., 

9.4 Comparison of Incomplete Block and Randomized Block Designs 

So far as the experimental operations are concerned, incomplete block 
designs are no more difficult than randomized blocks. Some extra 
planning is involved in drawing up and randomizing the experimental 
plan, especially if care is taken to make the best possible grouping of the 
experimental units. According to the computing equipment and ex- 
perience available, the time required for the statistical analysis may ex- 
ceed that for randomized blocks by 20 to 150%. 

The gain in accuracy over randomized blocks depends on the type of 
experimental material and may be expected to increase as the number of 
treatments is increased. Most of the incomplete block designs cover the 
range from 6 to 200 treatments, while the cubic lattices extend this range 
to 1000 treatments without requiring a large incomplete block. The 
number of treatments for which a substantial increase in accuracy is at- 
tained must be determined by experience. If the experimental material 
is highly variable and yet lends itself to the formation of small groups 
which are homogeneous, the designs may be advantageous even with 
small numbers of treatments. From the results of varietal trials a num- 
ber of comparisons with randomized blocks have been made: see, for ex- 
ample, references (9.6), (9,7), and (9.8). The experiments varied in size 
from the 3 X 3 to the 11 X 11, and included both lattices and lattice 
squares. They indicated an average gain in accuracy of the order of 
25%. This means that 4 replications of an incomplete block design were 
about as accurate as 5 replications of randomized blockvS. 

The ease with which the number of replications can be increased is 
also a factor. The object in the new designs is to obtain the most ac- 
curate comparisons that are possible from a given number of experi- 
mental units. Accordingly, the designs are likely to be most helpful when 
the amount of experimental material or considerations of cost and labor 
force the experiment to be smaller than is desired. Where the number of 
replicates can be increased without difficulty, the experimenter may pre- 
fer some extra replication of a simpler design which avoids the caliaila- 
tion of adjustments. 

There is one important property, possessed by many of the designs, 
which increases their attractiveness relative to randomized blocks. As 
the plans show, the lattice square and the lattice designs are arranged 
in complete replications as well as in incomplete blocks, A few of the 
balanced incomplete block designs can also be grouped into replications. 
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Such designs may be regarded as randomised block designs which have 
additional restrictions within each replication. It has been shown by 
Yates (9.2) that them designs can be ayialyzed as if they were ordinary ran- 
domized blocks. This implies that the unadjusted treatment means give 
unbiased estimates of the true treatment effects, and that the F- and t- 
tests do not lose their validity. Of course, this analysis will in general 
be less accurate than the complete analysis. 

Thus, if there is any criterion for forming incomplete blocks, an in- 
complete block design is worth a trial in preference to a randomized 
block design which occupies the same set of replications. When the data 
have been collected, the experimenter may choose whether to analyze 
them as randomized blocks or to complete the full analysis, with the 
adjustments for incomplete block variations. In fact, if the variation 
among incomplete blocks is no greater than that within blocks, the com- 
plete statistical analysis reduces automatically to that for randomized 
blocks, as mentioned in section 9.32. 

The analysis as randomized blocks may be useful in experiments where 
several measurements are made on each experimental unit. In the for- 
mation of incomplete blocks the imits are usually grouped with regard 
to the most important measurement. For certain other measurements 
from the same experiment, the grouping may be less effective; in such 
cases the ^ Randomized blocks^^ analysis will be satisfactory, as also with 
measurements of subsidiary interest, where the greatest attainable ac- 
curacy is not required. 

In some types of research an appreciable number of units are likely to 
be injured or destroyed in the course of the experiment, so that they must 
be omitted from the statistical analysis. With incomplete data (except 
whei-e only a few units are missing) laborious computations are required 
to calculate the block (or row and column) adjustments. Consequently, 
in experiments where missing data are of frequent occurrence, incom- 
plete block designs cannot be used to full advantage. Even in this case 
nothing is lost by using an incomplete block design wdiich can be ar- 
ranged in complete replicates. If when the experiment is completed it 
becomes evident that an appreciable number of units must be discarded, 
the experimenter may use the randomized blocks analysis in which the 
extra complication due to missing data is smaller. The same con- 
siderations apply in cases where certain treatments may have to be 
ignored in the final analysis. 

9.5 Comparisons with Other Designs 

Yates (9.1) has discussed three other designs that have been used 
when the number of treatments is large. 
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9.51 Systematic Controls. In agricultural field experimentSj a control 
variety is sometimes placed a, t regular intervals over the site^ the ex- 
perimeirtal varieties being arranged in complete blocks. From the yields 
of the control plots a fertility index may be calculated for every plot in 
the experiment. These indices are used to adjust the experimental plot 
yields for local variations in fertility. 

The method of systematic controls is very flexible, since it can be used 
with any number of treatments and any number of replicates. Little 
evidence is available about the increase in accuracy obtained from the 
controls, though it seems probable that the increase is seldom large if the 
extra space occupied by the controls is taken into account. The calcula- 
tion of the best adjustments, as described by: Yates (9,1), is rather tedi- 
ous, but if a crude type of adjustment is made most of the potential ad- 
vantages of the method may be lost. 

There may be additional reasons for the presence of extra controls, 
e.g., for their use in an observational scoring of the experimental ma- 
terial In this connection it should be noted tliat ext.ra controls can be 
included in an incomplete block design. 

9.62 Random Controls/ In another method the treatments are di- 
vided into groups, the grouping remaining the same in all replicates. 
Each group is arranged in a separate randomized block or latin square 
experiment. In order to obtain comparisons between treatments that 
are in different groups, one or more controls are included in each group 
and randomized along with the other treatments. Before comparing 
treatments tha,t are in diffei'ent groups, we subtract from each treat- 
ment mean the mean of the controls that are in the ^same group. Thus 
the controls serve to correct for differences in the fertility of the sites on 
•which different groups are tested. 

This design allows considerable flexibility in number of treatments and 
amount of replication and is simple to analyze. From theoretical con- 
siderations it is likely to be inferior in accuracy to a comparable in- 
complete block design, if one exists. Moreover the error variance is not 
the same for all types of comparison. If only one control is included, 
the variance is twice as great for the difference between two treatments 
that are in different groups as for two treatments in the same group. 

9.63 Split-plot Designs. Tliis airangement is a variant of the previous 
design which avoids extra controls. As in section 9.52, the treatments are 
first divided into a number of groups of equal sizes. Instead of testing 
each group on a separate site, the groups are combined into a single ex- 
periment of the split-plot type. For example, with 25 treatments and 
5 replications, we might first divide the treatments into 5 groups, A, >S, 
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C, /), and IL The groups are now regarded as whole-plo1^te 
and could be arranged in a 5 X 5 latin square on plots 5 times as large as 
the basic plots. Within each group the treatments are arranged at ran- 
dom on the individual plots. 

From the analysis for a split-plot design (section 7,15), it will be seen 
that we can test the dilTerence between tvvo teatments whether they 
are in the same group (i.e., whole-plot) or in different groups. Controls 
are no longer needed. The advantages and disadvantages of this ar- 
rangement are in general similar to those of the 'b'andom controls^^ : the 
split-plot design is more accurate if the space that would have been al- 
lotted to the controls is utilized. Sometimes the split-plot design is 
really more appropriate than an incomplete block design because the 
treatments divide themselves naturally into groups, and comparisons 
between members of the same group are regarded as more important 
than comparisons between members of different groups. 

9.6 Choice of Incomplete Block Design 

Tables 9.5 (randomized incomplete blocks) and 9,6 (lattice squares 
and incomplete latin squares) form an index to the incomplete block de- 
signs in this book. Each table is arranged by number of treatments {t) 
and number of units per block (&) and shows the numbers of replications 
for which designs are available, with references to the plans. It is hoped 
that the tables provide a rapid means of locating a suitable design, if one 
has been constructed. 

Where the niunber of treatments and the size of block are fixed in ad- 
vance by the conditions of the experiment, little choice is available to 
the experimenter. In certain types of research, however, both the niun- 
ber of treatments and the size of block (or row and column) can be varied 
to some extent without impairing the experiment, 

Wlrere more than one design appears appropriate, a design which can 
be arranged in separate replicates is preferable to one which cannot, and 
a balanced design is preferable to a partially balanced design. These 
recommendations usually narrow the choice to one or two designs. For 
example, with 25 treatments to be compared in 8 replications, we might 
use (see table 9.5) either a balanced incomplete block design in blocks of 
4 units, or a lattice design (partially balanced) in blocks of 6 units. The 
latter design, though not the former, can be laid out in separate repli- 
cates. 

The relative advantages of lattice and lattice square designs can be 
learned only by experience. From the results of a lattice square experi- 
ment we may estimate what the standard error would have been if 



TABLE 9.5 Designs akkanged in bandomized incomplete blocks 


Num- J 
ber of , 
treat- 
ments, 
t 

Num- 
ber of 
units 
per 
block, 
k 

Number of 
replications, 
r * 

Plan 

number t 

Num- 
ber of 
treat- 
ments, 
t 

Num- 
ber of 
units 
per 
block, 
k 

Number of 
replications, 
r 

Plan 
number t 

p.b.t 

b.t 


p.b.t 

b.t 


4 ■ 

' ' 2 


3 

11.1 

16 

4 

0 . . , 

' " 5" ■ 

10.2 


3 


3 

11. 


6 


6 

11.27 

5 

2 


4' 

11,2 


6 


9 

11,28 


3 


6 

11. 


10 


10 

11 .29 


4 


4 

11. 

19 

3 


9 

11.30 

6 

2 


5 

11.3 


9 


9 

11.31 


3 


5 

11.4 


10 


10 

11.32 


3 


10 

11.5 

20 

4 

2,3 


10.11 


4 


10 

11.6 

21 

3 


10 

n.33 


5 


5 

11, 


5 


5 

11.34 

7 

2 


6 

11. 


7 


10 

11.35 


3 


3 

11.7 

25 

4 


8 

11.36 


4 


4 

i 11.8 


5. 

1 2..* 

6 

10.3 


6 


6 

i 11. 


i' ''9'. 


9 

11 .37 

8 

2 


7 

i 11.9 

27 

1 3 

3 


, 10. 


4 


7 

11.10 

28 

.- ..' 4. ' 


.. 9.'. 

11.38 


7 


.7 - 

11. ’ 


7 


, 9 

11.39 

9 ■ 1 

2 


8 

.11.. 

30 

5 

2,3 


10.12 


3 

1 2,3 

4 

10.1 

31 

6 


0 

11.40 


4 


■ s.''! 

11.11 


10 


10 

11.41 


5 

1 

10 ' 

11.12 

36 

6 

2,3 


10.7 


6 


r ^ B ! 

11.13 

37 

9 


9 

11.42 


8 


8 

. .11, 

41 

5 


10 

11.13 

10 

■2 ■ 


i. .. .'9 ' :i 

11;14 

42 

6 

2,3 


10.13 


'3 1 


9 

11.15 

49 

7 

2 ... 

8 

10.4 


■ ^ 


1 ; 6^'\ 

11.16 ' 

56 

7 

2,3 


10.14 


5 


i 9 

11.17 

67 

8 


8 

11.44 


. 6'.'l 


9 

11.18 

04 

4 

3 


.10. 


9 


: ■o:: ■ 

11, 

64 

8 

2 ••• 

9 

10.5 

11 

2 ' 


10 

n. 

72 

8 

2,3 


10,15 


5 


6 

11.10 

73 

9 


9 

11.45 


d 

■ ' ' ' j 

: ^ 6V.' 

11.20 

81 

9 

2 ••• 

ID 

10.6 


Id 


10 

11. 

90 

9 

2,3 


10.16 

12 

3 

2,3 


10.10 

91 

10 


10 

11.46 

13 

3 


6 

11.21 

100 

10 

2, 3 


10,8 


4 


4 

11.22 

121 

11 

2 •• 


10.(12,7) 


9 


9 

11,23 

125 

5 

3 


10, 

15 

3 


7 

11.24 

144 

12 

2, 3,4 


10,9 


7 


7 

11.25 

169 

13 

2 


10.(12.8) 


8 


8 

11.26 







* Or any multiple of this number, 
t p.b, = partially balanced designs. 

b. =« balanced designs. 

J References 10., 11., denote chapters. 
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TABLE 9.6 Designs abeangeb in lattice squares and incomplete 

LATIN SQUARES 


Kuril- 
ber of 
treat- 
ments, 
t 

Num- 
ber of 
units 
per 
col. 
and 
row 

Number of 
replications, 

Plan 

number 

Num- 
ber of 
treat- 
ments, 
t 

Num- 
ber of 
units 
per 
e.oL 
and 

row 

Number of 
replications, 

■fe 

Plan 

number 

, 

■■ 

p.b.f 

b.f 

p.b.t 

b.t 

3 

S, 5 

5 


13.10 

11 

11, 5 


5 

13.3 


3, 7 

7 


13.17 


11, 6 


6 

13.4 


3, 8 

8 


13.18 


11,10 


10 

13. J 


3, 10 

10 


13.19 

13 

13, 4 


4 . 

13.5 

4. 

4,3 


(3)§,6 

134 


13, 9 


9 

13.0 


4, 5 

5 


13.20 

15 

15, 7 


7 

13.7 


4,7 

7 


1 13.21 


15,8 


S 

13.8 


: 4,9 

9 


1 13.22 

16 

4,4 


5 

12.2 


5,4 


4 

13. t 


i 16, 6 


6 

13.9 


5, 6 

0 


13.23 


16, 10 


10 

13.10 


5,9 

<) 


13.24 

19 

i 19,9 


9 

13.11 

6 

6, 5 


5 

i3.t ! 


, 19, 10 ' 


10 

13.12 


0,7 

7 


1,3.25 

21 

21, 5 


5 

13.13 

7 

7,3 


3 

13.1 

25 

5,5 


3 

12.3 


7,4 ' 


4 

13.2 ; 

31 . 

: 31, 6 1 


G 

13.4 


7,6 , 


(> 

134 I 

37 

37, 9 


9 

13.15 


7,8 

8 


13.20 

49 

7,7 

3 

4 

12,4 

8 

7,S ' 


7 

13. 1 1 

64 

8,8 

3... 

9 

12.5 

9 

3,3 


(2)§,4 

12.1 ! 

81 

9,9 

3,4 

5 

12.6 

9 

ft 8 


8 

134. : 

121 

11,11 

3... 

0 

12.7 

10 

10, 0 : 


9 

v 

134 

109 

13, 13 

3... 

7 

12.8 


Or any multiple of this number, 
t p.b. « partially balanced designs, 
b. — balanced designs. 

t Constructed from at X t Latin square by omission of last column. 

§ ( ) Not enough degrees of freedom for error. 

either of the two types of grouping had not been used. Consequently the 
experimenter may test additional methods of grouping by using lattice 
squares. If the extra grouping turns out to be ineffective, the accuracy is 
only slightly less than that of the corresponding lattice design. The 
statistical computations are, however, more laborious for lattice squares, 
since correction terms must be obtained for both row and column effects. 
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In generalj the lattice squares may be expected to be successful in typos 
of experimentation where the iatin square has been found siiperior to 

randomized blocks. 
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CHAPTER 10 

LATTICE AND CUBIC LATTICE DESIGNS 
lO.i Balanced Lattices 

10.11 Description. The number of treatments must be an exact 
square. The size of block is the square root of this number. Incomplete 
blocks are combined in groups to form separate replications. The special 
feature of the balanced lattice, as distinguished from other lattices, is 
that every pair of treatments occurs once in the same incomplete block. 
Consequently, all pairs of treatments are compared with the same degree 
of precision. 

The numbers of replications are rather severely restricted, as well as 
the numbers of treatments. The useful plans are indexed below. Bal- 

BaLAN CEP LATTICES CONTAINED IN PLANS 10,1-10.6 

Number of treatments 9 16 25 49 64 81 

Units per block 3 4 5 7 8 9 

Replicates 4 5 6 8 9 10 

anced lattice designs cannot be constructed for 36 treatments, and none 
has been found for 100 or 144 treatments. 

10.12 Statistical Analysis. The analysis is comparatively easy. It 
will be illustrated by an experiment on the effects of 9 feeding treatments 
on the growth rates of pigs, conducted by the North Carolina Agricul- 
tural Experiment Station. The analysis has been described in more de- 
tail by Comstock et at (10.1), and the nutritional information obtained 
from the x^esults by Peterson ct at (10.2). 

For pigs of a given breed, previous experience indicated that a con- 
siderable part of the variance in growth rate between animals can be 
ascribed to the litter. Hence the experiment was planned so that litter 
differences would not contribute, to the intra-block error. The pigs were 
divided into sets of 3 litter-mates. Two sets of 3 were assigned to each 
block. Within a block, each treatment received one member of each set. 
Thus the experimental unit was composed of 2 pigs each feeding in the 
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same pen. The plan and growth rates are given in table 10.1; the plan 
has been rearranged so that it follows the same pattern as plan lO.l. 


TABLE 10.1 Gains in weight (pounds per day) for a total of 2 pigs 




Rep. 1 




Rep. II 



Block 




Totals 

Block 




Totals 

(1) 

(1) 

(2) 

(3) 


(4) 

(1) 

(4) 

(7) 



2.20 

1.84 

2.18 

6.22 


1.19 

1.20 

1.15 

3.54 

(2) 

(4) 

(6) 

(6) 


(5) 

(2) 

(5) 

(8) 



2.05 

0.85 

1.86 

4.76 


2.26 

1.07 

1.45 

4.78 

(3) 

(7) 

(8) 

(9) 


(6) 

(3) 

(6) 

(9) 



0.73 

1.60 

1.76 

4.09 


2.12 

2.03 

1.63 

5.78 





15.07 





14.10 



Rep. Ill 




Rep. IV 


(7) 

(1) 

(5) 

(9) 


(10) 

(1) 

(6) 

(8) 



1.81 

1.16 

1.11 

4.08 


1.77 

1.67 

1.43 

4.77 

(8) 

(2) 

(6) 

(7) 


(11) 

(2) 

(4) 

(9) 



1.76 

2.16 

1.80 

5.72 


1.50 

1.60 

* 

1.42 

4.52 

(9) 

(3) 

(4) 

(8) 


(12) 

(3) 

(6) 

(7) 



1.71 

1.57 

1.13 

4.41 


2.04 

0.93 

1.78 

4.75 





14.21 





14.04 




Treatment totals and adjustment factors 









Adjusted total iMean per 



T 

Bt 

IF = (32' - iBi + ff) 

T + ixW 


unit 

1 


6.97 

18.61 


+3.89 


7.21 


1.80 

2 


7.36 

21.24 


-5.4.6 


7.02 


1.76 

3 


8.05 

21,16 


-3.07 


7.86 


1.96 

4 


6.42 

17,23 


+7.76 


6.91 


1.73 

5 


4^01 

18.37 


-4,03 


3.76 


0.94 

0 


7.62 

21.03 


-3.84 


7.38 


1.84 

7 


5,46 

18.10 


+1.40 


5.55 


1.39 

8 


5.61 

18.06 


+2.05 


5.74 


1.44 

9 


5.92 

18.47 


+1.30 


6.00 


1.60 



57.42 

172.26 


0.00 


57.43 




The steps in the analysis are as follows. The algebraic formulae refer 
to a /c X ^ lattice in blocks of k units, with r = (fc + 1) replicates. 

L Calculate the block totals, the replication totals, the grand total G, 
and the treatment totals T, shown under the plan. 

2. For each treatment, calculate the total Bt for all blocks in which 
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the treatment appears, For treatment 4, this is 
- 4.76 + 3.54 + 4.41 + 

As a check, the total of the Bt values should be k times the total of the T 
values. 

3. Compute the quantities 
whose sum should be exactly jzero. 

4. The analysis of variance is now obtained. The total s.s. and the 
simis of squares for replications and treatments are found in the usual 
way. The sunt of squares for blocks within replicationSj adjusted for 
treatment effects, is 

(3.89)2 + (5.46)2 + . * . + (1.30)2 


k^{k + 1) 108 

6, Calculate the adjustment factor 

(B5-- B.) 0.1776 0.0^ 


1.4206 




0.0628 


k^Ei 9 X 0,1776 

where B& and Ec are the blocks and intra+lock m.s., respectively. The 
adjusted treatment total is {T + shown in table 10. 1. To avoid 

confusion, the adjusted means are shown per unit (total of 2 pigs) al- 
though a mean per pig would be more natural. If .£'?> is less than Be, ^ 
is taken as zero, and no adjustments are applied to tlie treatment totals. 

TABLE 10.2 Analysts of vaeiance for total growth rate of 2 pigs 

d.f. 


Replications 
Treatments 
Blocks (adj.) 
Intra-block error 

Total 


General 

h 

(&" - 1) 
ik? - 1 ) 

(fc - - 1) 

( i .3 ^ ^.2 _ 1 ) 


S.S. 

0.0774 

3.2261 

1.4206 

1.2308 

5.0609 


0.1776 

0.0773 


= JS^e(14-fcM) = 0.0773(1 +3 0.0919 

purpose of the adjustment factor is to increase Ee so as to take ac- 
count of sampling errors in the block correction values ^LW. The ordi- , 
nary rules for the calculation of )5-tests may now be applied to £/. Thus 


6. For i-tests, calculate the effective error m.s. 


MiSSiNO DATA 
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the varitiuce of the dii'ference l)etween two adjusted tr(3atmciit totals is 
2rEe\ while that for the dilTcu'enco between two adjiiKsied treatment 
means is 2E//')% or O.OdOO. Tlie .standard error of this difference is 0.214. 
If means per pig were taken, tlie standard error would l)e 0.107. 

7. The treatments in.s. in table 10.2 cannot be tested against the 
intra-block error lu.s., since the former contains some block effects. For 
an approximate F-test, calculate the sum of scpiares of deviations of the 
adjusted treatment totals, which comes to 12.(5771, This is divided by 
r or 4 to bring to a >single-unit basis, and by — 1) or 8 to obtain the 
mean square, 0.3902. This is tested against the effective error m.s., 
0.0919. The F-vaiio, 0.3902/0.0919, or 4.31, has 8 and 10 d.f. 

8. In order to estimate the precision relative to randomized blocks, 
pool the mean stpuires for blocks (adjusted) and the intra-block error. 
The result is 0.1107, with 24 d.f., and is an unbiased estimate of the error 
variance that would have been present if the experiment had been ar- 
ranged in randomized blocks. This figure is compared with the effective 
error m.s., 0.0919. The relative precision is 0.1107/0.0919, or 120% 
This means that 1 replications of the balanced lattice appear to have 
been slightly less accurate than 5 replicates of randomized blocks. 

These designs may be used in factorial experiments where it is desired 
to avoid any sacrifice of replication on the interactions relative to that 
on the main efl’eets. Some of the combinations of levels that may be 
tested in balanced lattices are: 3 X 3, S X 2, 4 X 4, 4 X 2 X 2, 2^ 5 X 
h, 7 X 7, 8 X 8, 4;^ 2^', 9 X 9, 3‘b In analyzing a factorial experiment, 
first carry through the procedure described in this section. The sum of 
squares for deviations of the adjusted treatment totals is then divided 
into tlie sums of squares for main effects and interactions in the usual 
way; a divisor r must be introduced in order to convert these sums of 
squares to a single-unit basis. The resulting mean squares are tested 
against the effective error m.s., Ee(l + ifc/i), which lias (k — l)(k^ — 1), 
or in this case, 16 d.f. 

10,13 Missing Data. Since ItMme designs are often used in large ex- 
periments, involving perhaps several hundred observations, it is not eas}'' 
to ensure that all the^ observations are accurately made. Even with care- 
ful management of the experiment, there is always a chance that mistakes 
or accidents ^vill affect a few of the observations. Consequently, missing 
data tend to be more common with lattices than with small experiments. 

Methods for the analysis of the results of lattice experiments with in- 
complete data have been developed by Cornish (10.3), As might be ex- 
pected, the computations are lengthy. They become simpler in two 
special cases. The first arises when the incomplete blocks are ineffective, 
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so that the analji^sis reduces to a “randomized blocks” analysis. In this 
event values are substituted for the missing observations by the formula 
which applies to randomized blocks. The second case oceurs at the other 
extreme when the variation among blocks is so large that the inter-block 
information is negligible. Here the correct procedure is to insert values 
for the missing observations by minimizing the intra-block error 
m.s.. 

Since the general procedure must reduce to these two special pro- 
cedures in the appropriate caseSj it might be anticipated, as Gornish’s 
solution shows, that both the “randomized blocks” and the “intra- 
block” estimates for the missing observations are required. Thus we 
have to find two estimates for each missing value. Similarly, two anal- 
yses of variance are necessary, one to obtain the correct value for the 
intra-block error m.s. and one for obtaining the mean square among 
blocks. 

In an attempt to reduce the amount of arithmetic, some investigation 
has been made of the consequences of using a single estimate and a single 
analysis of variance. For this purpose the “intra-block” estimate seems 
the better of the two. It gives the correct intra-block en'or m.s.; the 
block m.s. is in general slightly too high. It provides an excellent ap- 
proximation when the blocks are effective and is at its Avorst when blocks 
are ineffective ; that is, Avhen a “randomized blocks^^ analysis should have 
been used. In the latter case it still gives unbiased estiim of the 
treatment means, but they are not as accurate as the estimates obtained 
by the use of the “randomized blocks” formula. The chief defect of the 
“randomized blocks” formula is that Avhen block variation is large it 
tends to give an overestimate of the intra-block error m.s., and some- 
times the bias is substantial. For this reason the “randomized blocks” 
estimate is considered more hazardous for general use, despite its greater 
simplicity* 

The procedure that is given for the incomplete block designs in this 
and succeeding chapters is to insert values for the missing observations 
by means of the “intra-block” formula. Thereafter the analysis is con- 
ducted in the usual way for complete data, except that in the analysis 
of variance 1 d.f. is subtracted from the intra-block error for ea(*h miss- 
ing observation. This method, or the more accurate Cornish method, 
should be used whenever it is intended to recover inter-block information. 
If many obserA^ations are missing, or if it is evident on inspection that 
blocks are relatively ineffective, the experimenter may decide at the start 
to analyze the data as a randomized block experiment. In this case sub- 
stitutes for the missing values should of course be obtained by the 
formula for randomized blocks (section 4.25). 
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The ^hiitra-block^^ formula for the estimate of a missing observation 
in a k X k balan{*ed lattice with (& + I) replications is as follows. 

h^T + /# + 1)JS -ll+G- Ml, - hBt 

^ — ^ ^ ^ _ ■ „ (10.1) 

h{k-lf 

In this formula;, and denote as usual the totals for the treatment, 
block, and replication which contain the missing observation, while G is 
the grand total Also 

Th = total (over all replications) of all treatinents that api)car in 
the block which has the missing oI>serv ation. 

Bt “ total of all loiocks in which the tnuitineiib with the missing 
observation appears. 

Example. Suppose that the observation 2.20 for treatment 1. in rep- 
licate 1 of tabic 10.1 had been missing. Form, the bkxdc, replicate, treat- 
ment, and grand totals, just as in tlie ordinary analysis. It is helpful to 
insert an x for the missing observation and to incslude this x in all totals 
where it should appear. When the value of x has been found, it can then 
be inserted in all the appropriate places and the dat^ are ready for com- 
puting the analysis of variance. The list of treatment totals appears as 
shown below. 

T 


1 

4.77 +x 

2 

7.36 

3 

8.05 

4 

6.42 

5 

4.01 

0 

7.62 

7 

5.46 

8 

5.61 

0 

5.92 


55.22 +x 


The quantities needed are 

T - 4.77, B - 4.02, R - 12.87, G - 55.22 
Tb = 4.77 + 7.36 + 8.05 - 20.18 
Bt = 4,02 + 3.54 + 4.08 + 4.77 = 16.41 

Hence 

(9)(4.77) + (12)(4.02) - 12.87 + 55.22 -- (3)(!20.18) - (3)(16.41) 
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The analysis of variance is now computed in the usual way, except 
that the total degrees of freedom are reduced from 35 to 34 and the 
intra-block error degrees of freedom from 16 to 15. The same adjust- 
ment factor ju may be used (there is actually a slight change w4iich can 
be ignored). 

When several observations are missing, the values are inserted by the 
method of successive approximation (illustrated in section 4.25). The 
exact formulae for i-tests are complicated. The following approximate 
rule assigns an effective number of replicates to each of two treatments 
A and B whose means are being compared. In any replicate of the ex- 
periment, A is credited with 0 replication if it is absent; with 0 replica- 
tion if A is present but B occurs in the same block and is absent; and with 
1 replication otherwise. The same rule is applied to B. For instance, in 
a i-test of the difference between treatments (1) and (3) in table 10.1, 
treatment (1) is credited with 3 replications and treatment (3) with 3 
replications (it loses 1 replication because in the first replicate of the ex- 
periment treatment (1) is in the same block and is missing). On the 
other hand, in comparing treatment (1) with treatment (4), the effective 
numbers of replications are 3 and 4, respectively. 

10,2 Partially Balanced Lattices 

10.21 Simple Lattices (Two Replicates). For an experiment with 2 
replicates, use the first 2 replicates of the appropriate set showm below 
(plans lO.l-'lO.G). Notice that designs are available for 36, 100, and 
144 treatments (plans UI.T-IO-Q) as well as for those numbers of treat- 
ments for which balanced designs are given. Plans for 121 and 109 
treatments can be taken from plans 12.7 and 12.8. 

The asymmetry of the designs is apparent from the plans; thus, with 
9 treatments (plan 10,1) the first treatment appears in the same block 
as treatments (2), (3), (4), and (7), but not in the same block as any of 
the remaining treatments. 

For 9 and 16 treatments, simple lattices are unlikely to be more ac- 
curate than randomized blocks unless the variation among incomplete 
blocks is great compared with that within iiuiomplete blocks. Further , 
the numbers of degrees of freedom for estimating the error are only 4 and 
9, as against 9 and 16, respectively, for randomized blocks. 

10.22 Triple Lattices (Three Replicates). The first 3 replicates of 
plans 10.1--10.9, 12.7, and 12.8 are used. Designs may be obtained for 
all squares from 9 to 169. With 9 treatments the precaution mentioned 
in section 10,21 applies also to the triple lattice. 
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10.23 Four Replicates. These cle, signs may be obtained eitlier (i) by 
duplicating the simple lattice or (ii) by using a quadruple lattieej i.e., 
the first 4 replicates of the plans. With 36 or 100 treatments, only the 
first method can be used, since no quadruple lattice exists. The second 
procedure i>s slightly preferable, because the resulting desig’ii comes 
closer to symmetry, but the statistical analysis requires more time. 

10.24 Five Replicates. The first 5 replicates from the balanced set 
are taken. Designs are not included for 9, 36, 100, or 144 t^reatmeiits. 

10.26 Higher Numbers of Replicates. Balanced designs should be 
used for the following numbers of treatments and replicates: 25, 6; 49, 
8; 64, 9; and 81, 10. For 9 treatments in 8 or 12 replicates, 16 treatments 
in 10 replicates, and 25 treatments in 12 replicates, the plan for tlie bal- 
anced design should be repeat(;d. 

In other cases the following recommendations are made. 

Six replicates. Use the triple lattice tivice. 

Eight replicates. Use the simple lattice Jour tiines or the quadruple 
lattice tioice. 

Nine replicates. Use the triple lattice three times. 

Ten replicates. Use the simple lattice ye twies or the quintuple lattice 
twice. 

The designs recommended are not alwa 3 ^s as fully balanced as they 
might be made. For instance with 6 replicates of 16 treatments the bal- 
anced design (5 replicates) plus 1 extra replicate gives a more nearly 
B 5 mimetrical arrangement than the triple lattice used twice. Since 
separate computing instructions would be I'equired for* such plans, we 
have preferred to adhere to a more uniform s^^stem. Foi' this reason no 
account is given of designs with 7 replicates, though theii* statistical 
analysis presents no greu^t difficulty to the reader who has inasteu*ed the 
principles. 

10.26 Arrangement of Experimental Material, In the arrangement of 
a group of experimental material so as to apply one of those designs, the 
most important criterion is that units within the same incomplete block 
be homogeneous. In fi.eld trials, for example, if the plots are oblong the 
usual procedure is to have the incomplete blocks, as nearly square as 
practicable, the plots extending the whole length of one side of the block. 
Uniformity trial investigations have shown that this layout gives on the 
average the most homogeneous block. In cases where there is detailed 
knowledge of the experimental site, some other method of grouping may 
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be superior. Efforts should be made to keep the experimental technique 
uniform for all units in the same block. Changes in technique that are 
necessary should be made when changing from one block to another. 

When the blocks have been formed, there is a secondary advantage in 
forming replications so that blocks within the same replicate are as sim- 
ilar to one another as the material permits, since this increases the pre- 
cision of inter-block comparisons. If the full statistical ts carried 

out} it is more important to have the incomplete blocks homogeneous 
than to have the replications homogeneous. Thus, when the grouping 
of blocks into compact replications is troublesome, this criterion may 
be ignoi'ed without much loss of precision. 

On the other hand, it is desirable to have homogeneous replications if 
the data may subsequently be anal3^zed as a randomized block design. 
As we have pointed out, the randomized block analysis may be needed 
in fields of work where certain experimental units or whole treatments 
are likely to be destroyed during the course of the experiment (e.g., in 
pasture plots subject to winterkill), 

10.27 Ratidomization, The randomization consists of three steps, 

L Randomize the blocks separately and independently within each 
replication. 

2. Randomize the treatments separately and independently within 
each block. 

3. Allot the treatments to the treatment numbers at random. 

For methods of randomization, see chapter 15. 

Steps 1 and 2 give each treatment an equal chance of being allotted to 
any experimental unit. These steps correspond to the allotment of 
treatments to units at random in an ordinary randomized block design. 

The function of step 3 is to decide by random choice which groups of 
treatments will form the blocks of the design. If differences among blocks 
are large, the error variance per plot for the mean of a group of treat- 
ments which lie in the same block may be considerably higher than the 
average error variance. This additional randomization ensures that the 
average error variance may be used/ in nearly all cases, for comparisons 
among groups of treatments. For further discussion, see reference (10.4) , 

When a plan is repeated in order to obtain extra replications, a separate 
randomization must be made for every replicate. 

10.28 Statistical Analysis, An account of the theory, with worked 
examples of the simple and triple lattice, is given in reference (10,5) ; a 
systematic presentation of the shortest computational methods, with 
worked examples of the simple, quadruple, and balanced lattices in 
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(10.6); and a detailed description of the methods for carrying out the 
computations on LB.M. (Hollerith) pmiched-card machines in (10.7). 
The computational methods differ slightly from one reference to another, 
but all are basically the same. 

Table 10.3 shows the plan and yields for a 5 X 6 simple lattice ex- 
pei’iment on soybeansj the treatments being 25 varieties produced in the 


TABLE 10.3 Yields of 5 X o simple lattice bxpeeiment on soybeans 
(Yields are in bushels per minus 30 bu.) 






Rep. I 





B 

<7 

fxO 

0) 

6 

(2) 

7 

(3) 5 

(4) 

8 

(6) 

6 

32 

+ 

61 

+ 

9.5 

(6) 

16 

(7) 

12 

(8) 12 

(9) 

13 

(10) 

8 

61 


8 

- 

1.3 

(11) 

17 

(12) 

7 

(13) 7 

(14) 

9 

(15) 

14 

54 

+ 

48 

+ 

7.5 

(16) 

18 

(17) 

16 

(IS) 13 

(19) 

13 

(20) 

14 

74 


15 


2.3 

(21) 

14 

(22) 

15 

(23) 11 

(24) 

14 

(25) 

14 

68 

+ 

17 

+ 

2.7 










289 

+ 103 

+16.1 





R,ep. 11 










(1) 

24 

(6) 

13 

(11) 24 

(16) 

11 

(21) 

8 

80 


9 

- 

1.4 

(2) 

21 

(7) 

11 

(12) 14 

(17) 

11 

(22) 

23 

80 


23 


3.6 

(3) 

16 

(8) 

4 

(13) 12 

(IS) 

12 

(23) 

12 

56 


8 

- 

1.3 

(4) 

17 

(9) 

10 

(14) 30 

(19) 

9 

(24) 

23 

89 

- 

32 

- 

5.0 

(5) 

15 

(10) 

15 

(15) 22 

(20) 

16 

(25) 

19 

87 


31 

— 

4.8 










392 

- 

103 

- 

16.1 



Treatment totals (unadj.) 




mC 





(1) 

30 

(2) 

28 

(3) 21 

(4) 

25 

(5) 

21 

H-0.5 





(6) 

29 

(7) 

23 

(S) 16 

(9) 

23 

(10) 

23 

-1.3 





(11) 

41 

(12) 

21 

(13) 19 

(14) 

39 

(15) 

36 

+7.5 





(16) 

21) 

(17) 

27 

(18) 25 

(19) 

22 

(20) 

30 






(21) 

22 

(22) 

38 

(23) 23 

(24) 

37 

(25) 

33 

+2.7 





mC-i 

..4 

-3 

.6 

-1.3 


i.O 

-4 

i8 

0.0 








Treatment totals (adj.) 








(1) 

38.1 

(2) 

33.9 

(3) 29 

.2 

(4) 29.5 

(5) 25.7 





(6) 

26.3 

(V) 

18.1 

(8) 13 

.4 

(9) : 

16.7 

(10) 16.9 





(11) 

47.1 

(12) 

24.9 

(13) 25 

.2 

(14) 41.6 

(15) 38.7 





(16) 

25.3 

(17) 

21.1 

(18) 21 

.4 

(19) : 

t.4.7 

(20) 22.9 





(21) 

23.3 

(22) 

37.1 

(23) 24 

.4 

(24) 34.7 

(25) 30.9 






soybean breeding program of the North Carolina Experiment Station. 
The experiment actually contained 4 replications, obtained by duplicat- 
ing the plan for the simple lattice. The first 2 replications will Berve to 
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illustrate the analysis for simple, triple, and quadruple lattices, which all 
follow the same general pattern. In section 10.29 the data from the 
whole experiment will be analyzed in order to indicate the procedure 
when the basic plan is repeated. 

1. Find the block totals B, the replication totals, the treatment totals, 
and the grand total. 

2. For each block calculate 

C = total (over all replicates) of ail treatments in the block — rB 

For example, for block 1 in replication 1, 

C = 30 + 28 + 21 + 25 + 21 - (2) (32) = +61 

Find the replicate totals Re of the C values. These replicate totals 
should add to zero. 

3. The analysis of variance is as follows. 


Roplications 

d.f. 

(?• - 1) 

1 

s.s. 

212.18 

m.s. 

Treatments (miadj.) 

0^ - 1) 

24 

559.28 


Blocks within replications (adj.) 

r{k - 1) 

8 

501.84 

62.78/4 

Intra-block error { 

[k - l)()-7o ~k~l) 

16 

218.48 

rSMEe 

Total 

{rk‘^ - 1) 

49 

1491.78 



All sums of stpiares are found in the usual way except that for blocks 
(adjusted), which is given by 

Eg- _ E^/ ^ (6.1)^ + (8)" + ---+( 31)^ _ (103)^ + (1,03^ 

kr(r — 1) /c+(r — 1) 10 50 

- 501.84 

where Ra denotes a replication total of the 6”s. 

4. The weighting factor used to obtain the adjusted treatment loials is 

(62.73 -- 13.66) 

■„ SS/--— =£; 0.1564,, 

k(r - l)Ei (5) (62.73) 

where Eh and Ee are, respectively, the mean scpiares for l4ocks and intra- 
block error. If Eh is less than the factor is l-aken as zero and no ad- 
justments are made for block eff(K'.ts, the experiment being analyzed as if 
in randomized blocks. 

Each C value is now multiplied by m lo obtain the block corrections 
liC. These should sum to zero, apart from rounding errors. Each treat- 
ment total is adjusted by applying the appropriate correction for every 
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hlock in wliich tiie treatoKMit For oxarnplej the adjnstecl total 

for treatinc^ivt 4 is 

25 + 0.5 A.O - 29.5' 

In the ease of the smrpls lattice it pa3^s to write the juCJ vahies around the 
tiAle of iiruidjusted treatment totals, putting tlie values for replication 
1 in the right-hand eolnmn mid those for replication 2 in the bottom row. 
Then the adjustments to treatment total are at the end of tlie row 
and the bottom of tlie column in which the treatment lies. 

5, The error variance of the difference lietween two treatment means 
is sligIdJy smaller for treatments that; appea,r in the same blocdc than for 
those that do not. The formulae are: 


Two troat.iiKmts in tin,' sanio block: 

r 

Two iicatiiK^nts not in same 

r jCi 


, 2E,J^ , rk^ 1 (2)(13.66)r, , (2) (5) ( 0.1 564) 

Average: J = -y-- + ^-7- 


The corresponding standard errors are 3.97, 4.23, and 4.15, respec- 
tivefo. Except with small designs it is sufficient to use the average value, 
4.15, for all ^-tests between pairs of treatments. 

6. Tlie analysis of variance does not supply an F-test of the adjusted 
treatment totals. A test of the unadjusted treatment totals can be ol> 
tained l;)}^ analyzing the data as if tlie experiment were in randomized 
blocks. The error m.s. is the pooled mean scpiare for Idocks and intra- 
block error, as shown below. 

d.f, s.s. m.a 

Tr^^^ltments (unadj.) ik" - 1) 24 559.28 23.30 

Error - l)(r ^ 1) 21 720,32 30.01 


This test is not fulty sensitive, since it is based on unadjusted treatment 
totals, but it will be sufficient in cases where an F-test is of secondary in- 
terest or where differences among incomplete blocks are small. 

An F-test of the adjusted treatment totals requires a little more cal- 
culation. This is most easily done by changing the unadjusted treat- 
ments s.s. (which is already available) so that it may be tested against 
the intra-block error m.s. Calculate Bu, the unadjiiBted sum of squares 
for blocks within replications, which comes to 350.00, If Ba the ad- 
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justed sum of squares for blocks within replications (501.84), we sub- 
tract the following quantity from the treatments s.s. 


k{T — l)/x 


[[ r 

I .(7- — 1)(1 + kix). 


= (5)(1)(0.1564) 


1 + (5)(0.1564)J 


(360.00) - (501.84) [ = -85.30 


In this case the subtraction term is negative, which is rather unusual. 
We add 85.30 to the unadjusted treatments s.s. The /^^-test is then com- 
pleted by the following analysis. 



d.f. 

s.s. 

m.s. 

Treatments 

24 

644.58 

26.86 

Intra-block error 

16 

218.48 

13.66 


The F-ratio, 1.97, lies between the 10 and the 5% levels. The F value 
differs considerably from that obtained by the randomized blocks test, 
as may happen when the adjustment for incomplete block effects results 
in a substantial increase in precision. 

7. The error m.s, in the randomized blocks analysis was found to be 
30.01. To estimate the gain in accuracy over randomized blocks, we 
compare this figure with the effective error variance 


1 + 




(fe + 1). 


(13.66) 


1 + 


(2)(5)(0.1564) 


17.22 


The relative accuracy is 30.01/17.22, or 174%. 


10.29 Statistical Analysis for Repetitions of the Designs, Suppose 
that the basic design contains n replicates (where, e.g., n = 2 for a simple 
lattice) and that this is repeated p times, so that the total number of 
replications is r — np. For illustration, we present a joint analysis of 
the data in tables 10.3 and 10.4 (pp. 275 and 289), which show 4 repli- 
cations of the 5X5 simple lattice. Replication III is a repetition of 
replication I, and IV of 11. The field results have been rearranged so 
that treatments follow the same order within corresponding replications. 
In terms of our notation, 7i = 2^ p — 2, and r = 4, 

1. Calculate the block totals, the replication totals, the treatment 
totals (shown in table 10.4), and the grand total. 

2. Arrange the block totals in supplementary tables as on the right of 
table 10.4. In these tables, blocks that contain the same set of k treat- 
ments are placed in the same row. Each table has k rows and p columns, 
while the number of tables is n. Form the row and column totals of each 
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ta,ble. For each group of similar blocks, compute the quantities 

(7 == total (over all replicates) of the treatments appearing in the 
group — 7i(total of the group of blocks) 

Thus for the first block in replications II and IV, which contains treat- 
ments (1), (6), (II), (16), and (21), 

C - 59 + 51 + SO + 

Find the replication totals Rc of the C values; these should ad<l to zero. 
3, The analysis of variance i>s as foIlo\¥s. 


Replications 

d.f. 

(r-l) 

3 

s.s. 

226.19 

m.s. 

Treatments (unad j .) 

(/c«-l) 

24 

791.24 


Blocks within replications (adj.) 

f(A'-l) 

16 

786.00 

49J2A’/> 

Component (a) 

nip - iKfc - 1) 

S 

164.72 


Component (fe) 

nik — 1) 

8 

621.28 


Intra-block error 

[h - - X) 

56 

761.56 

is.ooi;. 

Total 

{rk^ - 1) 

99 

2664.99 



The blocks s.s. contains two components, both of which are obtained 
from the supplementary table of block totals. 

Cimiponent (a) is a new component, which arises only when the design 
has been repeated. It is composed of the differences between the totals 
of ]:Iocks that contain the same set of treatments. Since each supple- 
mentary table has k rows and p columns, it may be analyzed as follows. 


Rows 

Columns 

Rows X (columns 


d.f. 


{k - 1 ) 

(p - 1) 

(A: -- 1)(^ -- 1) 


Total 


(/c/i - 1) 


The sum of squares for component (a) is the sum of the ro\vs X columns 
interactions over all n tables. It may be obtained from the following 
calculations. 


Total: 

Rows: 


OoluinnH: 


( 32 )^' + ( 91)2 - 1 - . . . + ( 75)2 ^ ( g 4^)2 ( 550)2 ^ ( 749)2 

(104)2 + (142)2 4. ... 4. (171)2 (050)2 ^ (749)2 

(289)2 + (361)2 4. (392)‘-2 4. (357)2 (550)2 (749): 


- 25 


50 


-- 602,18 

- 309.28 

=- 12 ai<S 


Component {a) s.s. — 602.18 — 309.28 — 128.18 =« 164.72 


288 


LATTICE AND CUBIC LATTICE DESIGNS 


10.29 


The above is a general method that covers all cases. With a simple 
lattice, a quicker alternative is to calculate the differences between the 
totals of similar blocks. The sum of square>s of deviations of these dif- 
ferences from their replication means, divided by 2 /i;, gives the sum of 
squares for component (a). 

Com'ponent (ft) is the component that is present even when there are 
no repetitions. 

Component (b) s.s. = 7— — 77 7 ^— 77 

kr{n - 1) - 1) 

( 63 )^ + ( 29 )^+***+ ( 48)2 ( 99)2 4 .( 99)2 


- 621.28 

The sum of squares for blocks within replications (adjusted) is the pooled 
sum of squares for the two components. All other terms in the analysis 
of variance are computed in the usual way. 

4. If El, is the pooled mean square for blocks, and Ec that for erroi^ 
the weighting factor is 

p(Ei - Ee) (2) (36.52) 

.^ 7 :^ == 0.1270 

k[{r ^ p)E, + {p-^ l)E,] (5)[(2)(49.12) + 13.G0] 

where it will be recalled that p is the number of repetitions of the basic 
design, in this case 2 . As usual, no adjustments are made if Eb is less 
tlim Ee* 

The block corrections /zC are obtained and entered in table 10.4; they 
should add to zevo except for rounding errors. Each treatment total is 
adjusted by applying a correction for each group of blocks in which the 
treatment appears. For instance, treatment (13) appears in the third 
block in replications I and III and in the third block in replications II 
and IV. Its adjusted total is 


44 + 8.8 -- 0,1 = 52.7 

5. The error variances for the difference between two treatment means 


Two treatments in the same block : 

2Ee 


■ [1 4- (a 


(2) (13.60) 


[1 + 0.1270] « 7.70 


Two treatments not in same block: 


— [1 + nfM] [1 + (2)(0.1270)] *= 8,06 

r 4 


Average: 


J ^ j* 


ik + 1 ) 


(2)(5)(0.1270)' 
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TABLE 10.4 A 5 X 5 simple lattice (hi-ips. Ill and IV) 

I Block totals 






Rep. 

III 






1 

III 

Sum 

■ € 

J^C 

(1) 

13 

(2) 

2() 

(3) 

9 

(4) 

13 

(5) 

11 

72 

32 

72 

104 

+63 

+ 8.0 

(6) 

15 

(7) 

18 

(S) 

22 

(9) 

11 

(10) 

15 

81 

61 

81 

142 

-21) 

- 3.7 

(11) 

19 

(12) 

10 

(13) 

10 

(14) 

10 

(15) 

16 

65 

64 

65 

119 

+69 

~t“ 8.8 

(16) 

21 

(17) 

16 

(18) 

17 

(19) 

4 

(20) 

17 

75 

74 

' 75 

149 

-34 

- 4.3 

(21) 

15 

(22) 

12 

(23) 

13 

(24) 

20 

(25) 

8 

m 

68 

68 

136 

+30 

+ 3.8 











361 

289 

361 

650 

+99 

+ 12.6 





Rep. 

IV 




1 

.i 


II 

IV 

Bum 

C ■ 


(1) 

16 

(6) 

7 

(11) 

20 

(16) 

13 

(21) 

21 

77 

80 

77 

157 

- 3 

- 0.4 

(2) 

15 

(7) 

10 

(12) 

11 

(17) 

7 

(22) 

14 

57 

80 

57 

137 

+ 2 

+ 0.3 

(3) 

7 

(8) 

n 

(13) 

15 

(IS) 

15 

(23) 

16 

64 

56 

64 

120 

- 1 

- 0.1 

(4) 

19 

(9) 

14 

(14) 

20 

(19) 

6 

(24) 

16 

75 

89 

75 

164 

-49 

- 6.2 

(5) 

17 

(10) 

18 

(15) 

20 

(20) 

15 

(25) 

14 

84 

87 

84 

171 

-48 

- 6.1 











357 

392 

357 

749 

-99 

-12.5 


Treatment totals (4 reps.) 


(1) 

59 

(2) 

69 

(3) 

37 

(4) 

57 

(5) 

49 

+8.0 

(6) 

51 

(7)' 

51 

(8) 

49 

(9) 

48 

(10) 

56 

'-3.7 

(11) 

80 

(12) 

42 

(13) 

.44' .. 

(14) 

69 

(15) 

. 72 ■ ■: 

+8.8 

(16) 

63 

(17) 

50 

(18) 

57 

(19) 

32 

(20) 

62 


(21) 

58 

(22) 

64 

(23) 

52 

(24) 

73 

(26) 

55 

+3,8 

-0.4 

+0.3 

-0.1 

-6.2 

-6.1 






AdjuHted treatment totals 




(1) 

66.6 

(2) 

77.3 

(3) 

44.9 

(4) 

58.8 

(5) 

50.9 


(6) 

46.9 

(7) 

47.6 

(8) 

45.2 

(9) 

38.1 

(10) 

46.2 


(11) 

88.4 

(12) 

51,1 

(13) 

52.7 

(14) 

7L6 

(15) 

74.7 


(16) 

58.3 

(17) 

4(l0 

(18) 

52.6 

(19) 

2L5 

(20) 

51.6 


(21) 

61.4 

(22) 

68.1 

(23) 

55,7 

(24) 

70.6 

(25) 

52.7 



6. An approximate F-test of the adjusted treatment totals is obtained 
by the method described in the previous section, step 6. In this case the 
quantity that must be subtracted from the treatments s,s. is 

k{n — l)p — iwTTlTTT 

Lin - l)(l + kfx) J 

where Bu is the unadjusted and Ba the adjusted sum of squares for com- 
ponent (6) of the blocks. Both quantities have already been computed: 
Baj 621.28, appears in the analysis of variance, while 309.28, is the 
rows s.s. used in computing component (a). 
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7. A comparison Avith randomized blocks is made as in the previous 
section, step 7. The effective error variance of the lattice is taken as 


ji/ca 


10.210 Missing Data. If the missing observations are numerous, an 
analysis by the method for randomized blocks is recommended. If it is 
desired to carry out the full analysis, missing values are estimated by the 
formula which minimizes the intra-block error s.s., though as explained 
in section 10.13 this method is only approximate. 

Experiments with no repetition of the basic design. 

(r - l)h?T - rR + 0 - rhC + TcC 

(r - l)(k - l)(riS: - /c - 1) \ ; 

In this formula r and R are the totals for the treatment and i-eplicate 
that contain the missing value, G is the grand total, and C is the C value 
(defined in section 10.28) for the block which contains the missing obser- 
vation. The only unfamiliar quantity is C" : 

" .'9 

C' = total of the C/ values for all blocks which contain the treat- 
ment that has the missing value 

Example, For the experiment analyzed in section .10.28, suppose that 
3 observations are missing: treatments (1) and (12) in replication I and 

TABLE 10.5 Sotmary totals for a 5 X 5 siiviplb i.atticu with three 

. MISSING. VALUES ' . 


Treatment totals 


41 

29 

22 


28 

(3) 

21 

(4) 

25 

(5) 

21 

23 

(8) 

16 

(0) 

23 

(10) 

23 

14 -h y 

(13) 

19 

(14) 

9 

(15) 

36 

27 

(18) 

25 

(19) 

22 

(20) 

30 

38 

(23) 

23 

(24) 

37 

(25) 

33 





Kep. II 






B 


C 



67 


^ z 


--15 +a: 
-30 + y 
-8 

~2 — z 
--31 
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treatment (14) in replication IL Denote the estimates of the 3 missing 
values by ;r, and respectively; these will be obtained by the method 
of succesvsive approximation. 

It is best to start by finding ail block, replicate, and treatment totals 
and all C values, as in steps 1 and 2 of the computing instruetions for 
this design. Wherever a missing observation is involved, include the ap- 
propriate ;r, or z in its place. The summary totals are shown in table 
10.5. Note that the total of the 0 values over the whole experiment is 
identically zero, so that a good check is available on this part of the cal- 
culations. 

We must now find first approximations for ?/ and z. In the replications 
in which they are present, their values are 14 and 9, respectively. How- 
ever, replication II appears to have higher values than replication I, and 
it is probably worth while to make an adjustment for replication effect. 
If we ignore the missing values, the means per plot are about 12 for 
replication I and 15 for replication II. Since y is missing in replication 
I, we subtract 3 from its value in replication II, giving 11. For ^ we add 
3, giving 12 as the first approximation. Using y = 11, z - 12, we now 
solve for from formula (10,2). For r = 2, /c = 5 the formula becomes 

25T - 2R + G - IOC + 5(7' 

X =' : — ' — : 

■ , 16 ■■ 

Put y == 11 j z 1'^. From table 10.5 we find for x 

T - 24; R - 276 + 11 - 287; C = 638 + 11 + 12 - 661 
C ^ 67; C' - 67 - 15 = 52 


X = 


(25)(24) - (2)(287) + 661 (10)(67) + (5)(52) 


277 

17 ' 

IG 


Put .r — 17\ z = 12. For y we have 

T - 14; R = 276 + 17 + 293; G - 038 + 17 + 12 - 667 
C - 25 + 12 = 37; C' - 25 + 12 - 30 - 7 
(25)(14) - (2)(293) + 667 - (10)(37) + (5)(7) „ ^ 

; ~ ------- , . ^ - . ■ 

Put X = 17, y = 8. For z we have 

3' = 9; = 362; G = 638 + 17 + 6 = 661; C' == -2 

C" = -2 + 25 - 6 = 17 

(25)(9) - (2)(362) + 661 - (10)(-2) + (.5)(l7) _ 2C7 


17 


S' = 


16 
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7. A comparison with randomized blocks is made as in the previous 
section, step 7. The effective error variance of the lattice is taken as 




1 + 


nkfi 

w+^ 


10.210 Missing Data. If the missing observations are numerous, an 
analysis by the method for randomized blocks is recommended. If it is 
desired to carry out the full analysis, missing values are estimated by the 
formula which minimizes the intra-block error s.s., though as explained 
in section 10.13 this method is only approximate. 

Experiments with no repetition of the basic design. 


(r - - rR + Q - rhC ■{- hC’ 

T 


^ {r - l){k ~ l){rh ~ k - l) 

In this formula T and R are the totals for the treatment and replicate 
that contain the missing value, G is the grand total, and C is the C value 
(defined in section 10.28) for the block which contains the missing obser- 
vation. The only unfamiliar quantity is C': 

C' = total of the C values for all blocks which contain the treat- 
ment that has the missing value 

Example. For the experiment analyzed in section 10.28, suppose that 
3 observations are missing: treatments (1) and (12) in replication I and 

TABLE 10.5 SuMMAEY totals for a 5 X 5 simple latoicb with three 

MISSING VALUES 

Treatment totals 


(1) 24: 

(2) 

28 

(3) 

21 

(4) 

25 

(5) 

21 

(6) 29 

(7) 

23 

(8) 

16 

(9) 

23 

(10) 

23 

(11) 41 

(12) 

14 4 y 

(13) 

19 

(14) 

9+z 

(15) 

36 

(16) 29 

(17) 

27 ■ 

(18) 

25 

(19) 

22 

(20) 

30 

(21) 22 

(22) 

38 

(23) 

23 

(24) 

37 

(25) 

33 

Rep. I 





Rep. II 



B 


C 



B 


C 


26 -fa: 


67 - 



80 


-15 4 ^ 


61 


-8 



80 


-30 4 y 


47 + 2/ 


25 - ?/ H- 

■2: \ 


56 


-8 


74 


-15 



59 4 3 


-2 -z 


68 


17 



87 


-31 


h ^ 276 + a; 4- ^ 

86 

- X - y 

Ei 

= 362 4- 3 -86 a: + V 



(t =! i2i + ^2 638 + i*? + 2/ + 2 
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treatment (14) in replication II. Denote the estimates of the 3 missing 
values by x, y, and z, respectively; these will be obtained by the method 
of successive approximation. 

It is best to start by finding all block, replicate, and treatment totals 
and all C values, as in steps 1 and 2 of the computing instructions for 
this design. Wherever a missing observation is involved, include the ap- 
propriate Xj y, or z in its place. The summary totals are shown in table 
10.5. Note that the total of the C values over the whole experiment is 
identically zero, so that a good check is available on this part of the cal- 
culations. 

We must now find first approximations for y md z. In the replications 
in which they are present, their values are 14 and 9, respectively. How- 
ever, replication II appears to have higher values than replication I, and 
it is probabb^ worth while to make an adjustment for replication effect. 
If we ignore the missing values, the means per plot are about 12 for 
replication I and 15 for replication II. Since y is missing in replication 
I, we subtract 3 from its value in replication II, giving 11. Porz we add 
3, giving 12 as the first appimimation. Using y = 11, z = 12, we now 
solve for x from formula (10.2). For r = 2, /c = 5 the formula becomes 

25T - 2R + G - IOC + 5C' 

X == — ^ 

. lO'-' 

Put y — ip z = 12. From table 10.5 we find for x 


T = 24; E = 276 + 11 = 287; C - 638 + 11 + 12 - 661 
. e = '67; ' C'v= 67 -15' = 

(25) (24) ~ (2) (287) + 661 - (10) (67) + (5) (52) _ 277 __ ^ 
. ^ ____________ - 

Put X = 17, z = 12. For y we have 

r = 14; E = 276 + 17 + 293; G = 638 + 17 + 12 = ()67 
(7 = 25 + 12 = 37; C' = 25 + 12 - 30 = 7 
(25)(14) - (2)(293) + 667 - (10)(37) -f (5)(7) 96 


Put X — 17, y = 6. For z we have 

T = 9; S = 362; G = 638 + 17 + 6 = 661; C == -2 
C = -2 + 25 - 6 = 17 

(25)(9) - (2)(362) + 661 - (10)(-2) + (5)(17) ^ ^ ^ 


z 
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This completes the first round. A second round leads to x ^ 18, y = 
z == 17, and there is obviously no need for further calculation. The 
rest of the analysis proceeds as usual except that 3 d.f. are omitted from 
the total s.s. and the intra-block error s.s. For approximate ^-tests, see 
section 10.13* 

Experime^iU with repetitions of the basic design. If the basic design has 
replicates and these are repeated p times to give r = np replications, 
the formula becomes 


X = 


{n - l)h^T + {n - l)ri2 + G- nkC + kC' - n^R' 


(10.3) 


All symbols have the same meaning as in (10.2) except that C and C' are 
now derived from the totals of groups of similar blocks, just as in the 
statistical analysis for this case in section 10.29. The new quantity IV 
is defined as 

72' = total of all replications that are similar to the replication con- 
taining the missing value 

Note that this total includes the replication with the missing value. 

Example, In the 5X5 simple lattice with 4 replications, wo have ??- == 
2, r = 4. If treatment (1) is missing in the first replication, the reader 
may verify that 

r==53; 72 = 283; G = 1393; C = 69; C' = 69 - 9 = 60 
72' = 283 +361 = 644 

This gives 

(25)(53) + (4)(283) + 1393 ~ (10)(69) + (5)(60) - (4)(644) 


X = 


56 


844 


16 


10,3 Rectangular Lattices 

10.31 Description. These designs were developed recently by Harsli- 
barger (10.8) for k(k + 1) treatments in blocks of k units. They form a 
useful addition to the square lattices described in previous sections, since 
the allowable numbers of treatments, 12, 20, 30, 42, 56, 72, etc., fall about 
midway between the allowable numbers for square lattices. The sta- 
tistical analysis is quite similar to that for simple and triple lattices, 
though it takes more time because the block adjustments are not so 
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simple as with square lattices. The new designs are less symmetrical 
than the square lattices, in the sense that there is a greater variation in 
the accuracy with which two treatment means are compared. It will 
be recalled that only two standard errors are required for Hosts with the 
simple and triple lattice — one for two treatments that appear in the same 
block and one for two treatments that do not. The simple rectangular 
lattice requires four standard errors, while the triple rectangular lattice 
requires seven. For practical purposes it appears that these can nearly 
always be reduced to two. 

There are several ways of constructing the designs. One (suggested 
by G. S. Watson) is by means of a latiii square with (/j + 1) rows and 
columns, in wlxich every letter in the leading diagonal is different. When 
writing down the square we attach a number to all letters except those 
in the leading diagonal, as illustrated below for a 4 X 4 square. 


A 

B1 

C2 

J03 

D4 

C 

Bb 

AO 

B7 

A8 

D 

C9 

CIO 

Ull 

A12 

B 


In the first replication we place in a block all numbers that lie in the 
same row of the latin square, in the second replication all numbers that 
lie in the same column, and in the third all numbers that have the same 
latin letter. 


Block Rep. I Rep. II Rep. Ill 


1 

T" 

'2 . 

3 

■■.■.■'1 

1 

8 

11 

1 

1 5 7 

2 

4 

5 

6 

2 

'■ "2- 

5 

12 

2 

2 9 10 

3 

■ 7 

8 

9 

3 

3 

6 

9 

3 

3 4 n 

4 

10 

11 

12 

4 


7 

id 

4 

0 S 12 


The important feature of this arrangement is that no two treatments 
are in the same block more than once. The use of a latin square with 
different letters dowm the leading diagonal ensures that in the third 
replication the three numbers associated with any letter are all in dif- 
ferent rows and columns, and hence have not previously occurred to- 
gether in a block. 

Plans 10.10-10.16 were constructed by this method. By using the 
first 2 replications from any plan we obtain a rectangular lattice in 2 
replications, which b^^ means of repetitions can be used for an experi- 
ment in 4, 6, 8, etc., replications. By using all 3 replications of the plan 
we have designs for 3, 6, 9, etc., replications. 
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Alternatively (as pointed out by S. S. Shrikhande), we may construct 
the designs from a balanced lattice with (/c + 1)^ treatments. If the 
first replication is omitted^ and if all treatments that apj^ear in an^^' one 
selected block in the first replicate are omitted in subsequent replica- 
tions, it is easy to verify that we generate a design for k{k + 1) treat- 
ments in which every block is of size k and in which no two treatments 
appear together more than once in the same block. This method gives 
any number of replicates up to k, though of com’se the method fails when 
no balanced lattice exists, as mtli 36 treatments. The discussion in this 
section is limited to simple and triple rectangular lattices. 

The method of randomizing is the same as for lattices (section 10.27). 

10.32 Statistical Analysis. In each plan, the three basic replications 
are described as the X, F, and Z replications. Further, if we take any 
block in one replication and examine another replication, we find that 
there is one and only one block in the other replication that has no treat- 
ments ill common with the chosen block. These two blocks ivill be called 
partners. Since it is important to be able to distinguish partners in the 
statistical analysis, each block is denoted in the plan by two symbols, 
e.g., F2, one to mark the replication and one to indicate the partners. 
All partners carry the same number; thus the partners of F2 are X2 and 
Z2. It is advisable to write the block labels in the notebook in which 
the original results are recorded. 

Example. The example is a triple rectangular lattice for 12 treatments 
in blocks of 3 {k = 3). Ai’tificial data were assembled by taking true 
treatment effects as shown in table 10.6, and adding to them true block 
effects as given vdth each block in table 10.7. Thus the observation 7 

TABLE 10.6 Theatment effects m artificial data for a 3 X 4 

TRIPLE RECTANGULAR LATTICE 


Treatment 

True effect 

True treatment 
total 

1 

14 

66 

2 

7 

35 

3 

■ 2 ■' 

20 

4 

0 

14 

5 

3 

23 

6 

11 

47 

7 

9 

41 

8 

16 

62 

9 

8 

38 

10 

1 

17 

11 

5 

29 

12 

6 

32 
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for treatment (10) in block X4 of table 10.7 is obtained by addm^ the 
true treatment effect, 1, to the true block effect, 6. Since the average of 
the true block effects comes out to be 50/12, or 14/3, the true treatment 
totals, over the 3 replicates, will be (3 t +• 14), where r is the true treat- 

TABLE 10.7 Plan anb obsbrvations for a 3 X 4 bectangular latocb 


(Treatment numbers are enclosed in parentheses) 


Block 

True blocl 

V 






Tot 

als 

Ad j list riient 

symbol 

effect 



Eep. 

X 



B 

Ox 

factor 

A4 

6 

(10) 

7 

(12) 

12 

(11) 

11 

30 

' -12 

-0.6 

XI 

2 

(2) 

9 

(3) 

4 

(1) 

16 

29 

20 

4-3.4 

X"3 

7 

(7) 

16 

(0) 

15 

(8) 

23 

54 

- 5 ■ 

-1.6 

X2 

0 

(4) 

0 

(6) 

3 

(6) 

11 

14 

30 

4“5.4 









127 

33 






Rep. 

F 



B 

Or 


F4 

9 

(3) 

11 

(6) 

20 

(0) 

17 

48 

-32 

-4.6 

F2 

3 

(1) 

17 

(11) 

8 

(8) 

19 

44 

10 

4-1.4 

F3 

3 

(12) 

9 

(2) 

10 

(5) 

6 

25 

10 

+ 1.4 

FI 

5 

(10) 

6 

(4) 

5 

(7) 

14 

25 

0 

-0.6 


142 -12 





Rep. Z 


B 

Cz 


Z1 

4 

(8) 20 

(6) 15 

(12) 10 

45 

4 

+0.2 

Z2 

8 

(9) 16 

(10) 9 

(2) 15 

40 

-16 

-3.8 

ZS 

1 

(11) 6 

(3) 3 

(4) 1 

10 

19 

+3.2 

Z4 

8 

(5) 11 

(1) 22 

(7) 17 

60 

-28 

-3,8 






145 

-21 



Treatment totals 

1 2 3 4 5 6 7 8 9 10 11 12 

Unadj. 55 34 18 6 20 46 47 62 48 22 25 31 

Adj. 56 35 20 14 23 47 41 62 38 17 29 32 

ment effect. These figures are shown in the column at the right in table 

10 . 6 . 

No intra-block error has been introduced into the data, which there- 
fore provide two checks on the method of analysis. First, in the analysis 
of variance the intra-block error s.s. should be found to be identically 
zero. Second, the adjusted treatment totals as found from the analysis 
should be exactly equal to the true treatment totals. 

The computing instructions given below apply to either the simple or 
the triple rectangular lattice without repetitions. The changes required 
when there are repetitions are given later. 
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1 . Find the block totals, B, the replication totals, the treatment totals, 
and the grand total. 

2. For each block calculate 

a = total (over all replicates) of all treatments in the block — rB 
Thus for block T3, 

C - 18 + 46 + 48+" (3) (48) -= -32 
As usual, the total of the is 

3. Arrange the (7 values in a supplementary table so that partners ap- 
pear in the same row. The row totals of this table, S, give the sums of 
the C values for each set of partners. The column totals give values 
denoted by Ra. 

TABLE 10.8 Supplementary TABLE OF 0 vAL'iTEB 


Block 


symbol 

■ CX: 

Or 

Cz 

S 

XOx 

xc:'r 

XOx 


1 

20 

0 

4 

24 

4.0 

0.0 

O.S 

0.6 

2 

30 

10 

-.16 

24 

6.0 

2.0 

-3.2 

0.6 

3 

'"-.'5. ; 

10 

19 

;^:24, . 

-1.0 

2.0 

3.8 

0.6 

4 

-12 

--•32 

-28 

-72 

-2.4 

-6.4 

-5.6 

-1.8 

Totals (Rc) 

33 

-12 

-21 

0 

6.6 

-2.4 

-4.2 

0.0 


4. The analysis of variance is now obtained. All sums of squares are 
found in the usual way, except that for blocks, adjusted for treatments, 
which is 

. . E g" . : . : ' : ; 

r(rk ~ k~ 1) r(k -f l)(rfc — k — 1) )-(r — l)(k + l)(rk ~ k ~ 1) 

_ {20)H(30)2+ . . . + (28)2 (33)2+(12)2+(21)2 (2#+ • • • +(72)2 

V + 7 , 16 (E:; :: Vgo ; : 

= 274.0 - 27.9 - 57.6 = 188.5 (10.4) 


TABLE 10.9 Analysis op variance 


Source of variation 

d.f. 


s.s. 

m.s. 

Replications 

(r-1) 

2 

15.5 


Treatments 

(k^+k- 1) 

11 

1067.0 


Blocks 

rk 

9 

188.5 

20.9^& 

Intra-block error 

(r - l)(k^ - 1) - & 

13 

0.0 

0.0^, 

Total . 

(fF + rfc - 1) 

35 

1271.0 



As anticipated, the intra-block error s.s, in table 10.9 is zero. 
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5. We now calculate the weighting factors used to obtain the adjusted 
treatment totals. Although a general formula can be given, it is simpler 
to present these separately for simple and triple designs. Two weighting 
factors are necessary.* 

Simple rectangular lattice 

X 


r{Eiy - E,) 


r(k 


M = 


1)^^, + {rk - 2k + r)Ee 
Xr(Eiy — Ee) 

rik + l)ir+ {7k - ^ 

Triple rectangular lattice 


r{E, - E,) 


r(2k - 1)A?6 + (rA - 6k + r)Ee 
}ir(Eh — Ec) 


2r{k + 1)^6 + {rk — 6A; — : 
In this example Ee is zero and we have 

3 . . (0.2) (3) 


X = 


(3)(5) 


- 0 . 2 ; 


P = 


(2)(3)(4) 


lr)Ee 


- 0.026 


(10.5) 

( 10 . 6 ) 

(10.7) 

( 10 . 8 ) 


6. Complete table 10.8 by adding the coliunns Wx, XCV, and 
(xS, The adjustment for any block is 

XC - pS 

where S is taken from the row in which C lies in table 10.8. Thus for 
block XI the adjustment is 

+4.0 - 0.6 - +3.4 

These adjustments are recorded on the plan in table 10.7. Note that 
the order in which blocks appear is different in tables 10.7 and 10.8; care 
must be taken to see that each adjustment is given to the appropriate 
■block., : ■ 

7. Finally, each treatment total is adjusted by adding the adjustments 
for every block in which the treatment appears. For instance, the 
adjusted total for treatment (4) is 

6 + 5.4 — 0.6 + 3.2 ~ 14 

* This method of estimating the weights is slightly different from that given by 
Hanshbarger. In the interests of simplicity we have used onlj’' the pooled moan 
sejuare for blocks and the. intra-block error m.s. foj’ estimating X and 
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Every adjusted treatment total will be found to be equal to the true 
treatment total as given in table 10.6. 

Statistical analysis for repeMtions of the designs. When the basic design 
with n replications is repeated p times to give r = np replications, the 
changes required in the steps of the analysis are noted belovr. 

1. No change. 

2a, For any block there are (p — 1) other blocks that have exactly 
the same set of treatments. Arrange the totals for such similar blocks 
in two-way tables, as exemplified below. There are n such tables. 

X Blocks 



' Repetition 

Tofcai 

Block no. 

1 2 • • ' p 

(S,3) 

1 

2 



(fc-bl) 



Total 




26. For each group of similar blocks calculate 
C — total (over all replicates) of all treatments in the group n( 2]) jS) 
3. No change. 

4a. The blocks s.s, now’ has two components. For component (a) ob- 
tain the interaction s.s. for each of the tables in step 2a, and add these 
sums of squares. Since each block total contains k observations, a 
divisor k is required for the analysis of variance. 

46. The sum of squares for component 6 is 



^ 1 

W 

■ i 

' j 



r{nk — /c — 1) 

r(k + l)(n& — fc — 1) r(/i — 

l)(k + l)(nk - k 

- 1) 

The separation 

of degrees of freedom in the anabasis of variance is shown 

below. 

d.f. 

m.s. 



Replications (/ — 1) 

Treatments -f — 1) 

Blocks rk 

Eb 



Component (a) (r — 

Component (b) nk 




Intra-biock error (r — — 1) ~ 

k Ee 



Total -i- rk — 1) 
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5. The weighting factors in formulae (10.5)-(10.8) are unchanged. 
Note that S?) is the pooled mean square for blocks. 

6. Unchanged. 

7. Unchanged. 

As mentioned previously, in oi'der to have i-tests for every pair of 
treatments we would require to present 4 standard error formulae for 
simple rectangular lattices and 7 for triple rectangular lattices. With 
very little inaccuracy these can be reduced to 2, one for the case where 
the two treatments appear in a block and one for the case where this is 
not so. For the difference between two adjusted means the estimated 
error variances are: 


Simple recta/ngular lattice 


2Ee 


Two treatments in the same block: — ^ (1 -f A — lA 

r 

'2ilS 

Two treatments not in same block: (1 -}- 2A — 3,-. 

r 




2(/c -• 1) 
"b /c 1 


(1 -f X — ;i) + 


F - /c + 1 
4- k 

r 


^(1 +2X -3;.)] 


Triple rectangular lattice 


2/A - {k^ + Ic - l)ju1 
{k^ b /c - 1) 


2E. 


Two treatments in the same block: — ^'(1 H- 2X ~ ji) 

' r ■ 

2Ee ( 3 A 

Two treatments not in same block: 1 1 + 3X — - 


Average: 




_/c^ + /c - U" ' ' U^A-k-\ 

In the example, where X = 0.2, g. = 0.025, the factors multiplying the 
usual term 2EJr are 1.375 for two treatments in the same block and 
1.562 for two treatments not in the same block. The factor in the aver- 
age variance is 1.460. 


10.4 Cubic Lattices 


10.41 Description, These designs were produced by Yates for plant- 
breeding wmrk in which selections are to be made from an umisiialiy large 
number of varieties. The number of treatments must be an exact cube. 


* Owing to the fact that only two standard errors were used, this average is not 
quite equal to the average variance taken over all possible pairs, though it is very 
close to that value. 
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The most useful range comprizes 27, 64, 125, 210, 343, 612, 729, and 
1000 treatments. The size of block is the cube root of the number of 
treatments, i.e., 3, 4, 5, 6, 7, 8, 9, and 10, respectively. Thus cubic lat- 
tices can accommodate a large number of treatments in a small size of in- 
complete block. The designs have been used, for example, in an ex- 
periment with 729 strains of ponderosa pine seedlings (10,9) and an ex- 
periment with 729 soybean varieties (10. 10) . The number of replicates 
must be 3 or some multiple of 3. 

Since the^plans are easy to construct and since they occupy a con- 
siderable amount of space for the higher mniibers of treatments, they are 
not reproduced here. To obtain a plan, the treatments are numbered 
by means of a three-digit code in which each digit takes all values from 
1 toi;,. For 27 treatments, the code is as follows: 


T 

Code 

T 

Code 

T 

Code 

1 

111 

4 

121 

7 

131 

2 

211 

5 

221 

8 

231 

3 

311 

6 

321 

9 

331 

10 

112 

13 

122 

16 

. 102 

11 

212 

14 

222 

17 

232 

12 

312 

15 

322 

18 

332 

19 

113 

22 

123 

25 

133 

20 

213 

23 

223 

26 

233 

21 

313 

24 

323 

27 , 

333 


* Treatment number. 

The same principle applies with a larger number of treatments. For 
the first h treatments, the last two digits are fixed at (11) while the first 
digit runs from 1 to k The next k treatments are coded by fixing the 
last two digits at (21) while the first digit again runs from 1 to fc, and so 
on in a systematic manner until the final k treatments are reached, for 
which the last two digits have the fixed values (kk). 

Within each of the 3 replications, the k^ treatments are grouped into 
blocks, each of size k. In the first replication, the rule for this group- 
ing is to keep the last two digits constant within a block, allowing the 
first digit to take all values from 1 to k Thus, in the example above, the 
9 groups of treatments constitute the 9 blocks, block 1 containing the 
treatments (111), (211), and (311). 

To form blocks in the second replication, we keep the first and last 
digits fixed within any block and give the second digit all values from 
1 to k With 27 treatments, the first block therefore contains (111), 
(121), and (131), the second block (211), (221), and (231), and the last 
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block (313), (323), and (333). In the third replication, the first and 
second digits are constant within each block. 

The composition of the blocks in the second and third replications is 
shown in table 10.10. 

TABLE 10.10 Sbconid and third bepdicatks oir a cxtbic lattice with 




27 TREATMENTS 







Rep. II 





(1) 


(2) 

(3) 


(4) 

(5) 


Code T * 

Code T 

Code 

T 

Code r 

Code 

T 

111 

1 

211 2 

311 

3 

112 10 

212 

11 

121 

4 

221 5 

321 

. b 

122 13 

222 

14 

131 

7 

231 8 

331 

9 

132 16 

232 

17 

(6) 


(7) 

(8) 


(9) 



Code 

T 

Code T 

Code 

T 

Code T 



312 

12 

113 19 

213 

20 

313 21 



322 

15 

123 22 

223 

23 

323 24 



332 

18 

133 25 

233 

26 

333 27 






Rep. Ill 





(1) 


(2) 

(3) 


(4) 

(5) 


Code 

T 

Code T 

Code 

T 

Code T 

Code 

T 

111 

1 

211 2 

311 

3 

121 4 

221 

5 

112 

10 

212 11 

312 

12 

122 13 

222' 

14 

113 

19 

213 20 

313 

21 

123 22 

223 

23 

(b) 


(7) 

(8) 


(9) 



Code 

T 

Code T 

Code 

T 

Code T 



321 

0 

131 7 

231 

8 

331 9 



322 

15 

132 16 

232 

17 

332 IS 



323 

24 

133 25 

233 

26 

333 27 




* Treatment number. 

The original treatment numbers are shown as well as the code num- 
bers. In an experiment it is simplest to record only the code numbers, 
which are reciuired in order to follow the computing instructions. 

Ckibic lattices may be expected to be most useful when the number of 
treatments exceeds 100. The designs for k — 3, 4, 5, and 0 lie in the 
range which is also covered by latticie designs. We do not know of any 
investigations of the relative accuracy of lattice and cubic lattice designs 
in these cases.. With highly variable experimental material, the use of a 
smaller block might give the cubic lattice some advantage; on the other 
hand the statistical analy^sis is more laborious owing to the greater num- 
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her of block adjustments to be calculated. Like the lattice designs, 
cubic lattices cannot be appreciably less accurate than randomized 
blocks which occupy the same set of replications, and can be analyzed 
by the method for randomized blocks. 

Some cubic lattices can also be arranged in quasi-latin squares (10.11) 
so as to allow the elimination of two sources of error variation, 

10.42 Arrangement of Experimental Material. The most important 
rule is to have units in the same incomplete block as homogeneous as the 
experimental material permits; in field trials the blocks should be com- 
pact in shape. If practicable, blocks within the same replicate should 
also be similar to each other, so as to increase the accuracy of inter-block 
comparisons and to obtain more precise results if the analysis is sub- 
sequently carried out by the method for randomized blocks. 

10.43 Randomization. The steps are : 

1. Randomize the order of the blocks independently within each rep- 
lication, 

2. Randomize the positions of the treatment code numbers inde- 
pendently within each block. 

3. Assign treatments to code numbers at random. 

10.44 Statistical Analysis. The following reference should be con- 
sulted. 

(10.12) Yates, F. The recovery of inter-block information in variety 
trials arranged in thi'ee-dimensional lattices. Ann. Eugen. 9 y 
136-150, 1939. 

General formulae are given with a nmnerical example of the analysis for 
an experiment with 64 treatments. In following this example, the 
reader should note that the experiment was not arranged in separate 
replications; the changes introduced for this reason are pointed out by 
Yates (pp. 147-148). The method of analysis when 6 or 9 replicates are 
used is also indicated (p. 144). Reference (10.13) describes the analysis 
of an experiment with 125 treatments. 

10.45 Error Variances. The formulae for the error variance of the 
difference between two treatment means are summarized below. In the 
analysis as described by Yates (10.12), the symbols 

w -- w' 

2w + m' 


w w' 
w + 2w^ ^ ^ 
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play a prominent part. 

It is convenient to write X' == X/Vc^, = p/k^. 

The error variances for the different types of treatment comparisons are 

as follows: 



Comparison 

Variance of difference between means 

1^211 

~ hii 

— [1 + 2X' + 2(k - l)yu'l 

h22 

” hu 

2Be 

~fl -I- 4X' + (3/c - 4)yi 
r 

^^222 

— hn 

2E^ 

— ^ [1 + f)X' 4- 3(/c - 2 )m'| 
r 

Average 

2Eer 1 


Here Ee is the intra-block error m.s. and r the number of replicates 

(3, 6, or 9). The formulae, though written in a slightly different form, 

are identical with those given by Yates. Our h corresponds to Yates’s 

p, and our r to his n, 
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PLANS 


3 X B balanced lattic 


4 X 4 balanced lattice 


5X5 balanced lattice 


* The symbol X denotes the number of times that two treatments appear in the 
same block* 


T 

7 

13 

19 

25 

21 

2 

8 

14 

20 

16 

22 

3 

9 

15 

11 

17 

23 

4 

10 

6 

12 


24 

5 

Rep. VI 

1 

22 

18 

14 

10 


I 

2 

3 

”1 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

Rep. IV 

1 

14 


12 

li- 


11 

S 


1 

6 

11 

16 

5 

■ 2 

15 

12 

9 

14 

3 

S 

13^ 

10 

i 



T 

- 

9 

Is 

2 

6 

10 

14 

3 

7 

11 

15 

4 

8 

12 

16 


Rep 

V 


_ 

10 

15 


IT 

■2“ 

7 

16 


r 

2 

'"s 

4 

5 

6 

Y 

8 

9 


1 

5 

9 

7 

2 

6 

4 

8 

3 



Plan 104 


Block Rep. I 


(1) 

1 

2 

3 

4 

5 

6 

7 

(2) 

8 

9 

10 

11 

12 

13 

14 

(3) 

15 

16 

17 

18 

19 

20 

21 

(4) 

22 

23 

24 

25 

26 

27 

28 
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CHAPTER 11 


BALANCED INCOMPLETE BLOCKS 
li.l Description 

The balanced lattices described in the last chapter are a particular 
group of a general ckuss of designs known as balanced incomplete blocks. 
All these designs have the property that any pair of treatments appears 
together equally often within some block. Thus in plan 11.1 every pair 
of treatments appears together once in the same blockj in plan 11.2 twice, 
and in plan 11.5 four times. This property insures that the same stand- 
ard error may be used for comparing every pair of treatments; it also 
facilitates the statistical analysis, since any treatment totalis adjusted 
in a single operation for all the blocks in wdiich the treatment appears. 

The construction of the designs presents interesting mathematical 
problems, (11.1)) (11.2), (11.3). Although a design can be found for any 
number of treatments, i, and any size of block, /c, most of these are of no 
interest for our purpose, since they require too many replications. The 
plans in this chapter have been restricted to tliose in wdiich the number 
of replicates does not exceed 10. 

The balanced lattices are the particular set of designs for which the 
relation t = holds. They. also have the property that the blocks can 
be grouped in separate replications. The majority of the balanced in- 
complete block arrangements do not possess this property, which is pos- 
sible only when t is a multiple of k. A few designs can be sorted into 
groups w^hich comprize Iavo or three replicates each. 

In most plans, the block contains six or fewer imit>s. Accordingly, the 
designs are adapted for experiments in which the appropriate size of 
block is small They have been applied in greenhouse pot experiments, 
where the block is restricted to the width of the bench; in experiments 
on plant virus diseases, where the block consists of a small number of 
leaves on each plant; in experimental cookery where there is a limited 
number of stoves, in tests of mosquito repellents, reference (11.4), where 
the block consists of the two exposed arms of a subject; and in 
nutritional experiments where each child constitutes a block, 3 different 
foods being given during the 3 terms of the school year. 
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316 . BALANCED INCOMPLETE BLOCKS 11,1 

Examples from field experimental work occur in blueberry fertilizer 
trials wherej owing to the imeven growth of different bushes^ only a small 
number of them can be grouped into a homogeneous block. A similar 
consideration may limit the size of block in experiments on the control 
of fruit pests, where the tree constitutes the experimental unit. 

A few types of factorial design — ^the 5 X 2, 5 X 3, 7 X 3, and 7 X 4— 
can be arranged in balanced incomplete blocks with the result that main 
effects and interactions are confounded to the same extent. It happens 
that in these cases there is no suitable method of confounding the inter- 
actions alone. 

11,2 Comparisons with Other Designs 

Much experimental material is of the type exemplified above, in that 
the natural grouping for use as a block accommodates only a small num- 
ber of units. Where the number of treatments exceeds the number of 
units in a block, there are three alternatives to the use of a balanced 
incomplete block design. (1) A common treatment is placed in each 
block, so as to provide a basis for comparison between ti^eatments whicli 
are in different blocks, (2) The treatments are divided into groups such 
that the number of treatments per group is equal to the number of units 
per block. The groups are compared in a split-plot arrangement (chap- 
ter 7). (3) Complete replicates are assembled by the grouping of units 
from more than one block, in order that a randomized block design may 
be used. The merits of the first two alternatives were discussed in sec- 
tion 9.5. 

The relative efficiency of randomized blocks and balanced incomplete 
blocks has been discussed by Yates (11.5). Balanced incomplete block 
designs which are arranged in replications cannot be appreciably less 
accurate than randomized blocks. With balanced incomplete blocks 
which cannot be arranged in replications, a comparison with randomized 
blocks is not easy except by the use of uniformity data. Certain con- 
clusions can be drawn from the nature of the experimental material If 
there is some association among the members of different blocks, it may 
be possible to form replicates which are only slightly less homogeneous 
than the blocks. For example; suppose that in an animal experiment the 
litter is the natural block. If the responses of the animals are influenced 
by their genetic constitution and if replications are constructed from 
closely related litters, the experimental error may be only slightly greater 
within a replication than within a block. In this event randomized 
blocks may be the more accurate layout. 

Sometimes the investigator has no information about any associations 
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between the groups which constitute the blocks; he knows merely that 
the experimental responses are likely to differ from group to group. In 
these circumstances Yates (11,5) has shown that incomplete blocks (with 
the recovery of inter-bloclc information) are nK.>re accurate on the aver- 
age except when tlie blocks are ineffective, There is the additional ad- 
'\uintage that the blo(*k dilTerences can be measured as a guide to the de- 
sign of fiitui '0 experiments, wherc^as with replications composed of units 
from several blocks it would be diflicult to e.stimate the block effects. 

It is worih noting the eficicncy faeior (E) of any balanced incomplete 
block design whose use is under (ionsideration. This factor, whicli is shown 
as a decimal fraction in table 11.3 (p. 327) and in the plans, is a lower 
limit to the efficiency of balanced incornph'te blocks relative to ran- 
domized l^locks. With 37 treatments in tilocks of 9 units, for instarice, 
tlie efficiency factor is 91%, so that the loss of efficiency could not exceed 
9%. The loss equals 9% only in the rather unlikely situation in which 
the replications of 37 units are as uniform as the blocks of 9 units, yet the 
variation among blocks is large compared to that within blocks. With 
any reasonable prospect that the blocks are more homogeneous than 
the replicates, the incomplete block design is to be preferred in this 
case. On the other hand, with 7 treatments in blocks of 2 units, the 
efficienc 3 ^ factor is only 58%. In this case incomplete blocks will be 
more accurate onlj^' if there is a su]>stantial reduction in error variation 
because of the reduction in block size. 

11.3 Arrangement of Experimental Material 

Units within the same incomplete block shoidd be as homogeneous as 
possible. For thos<i designs which can also })e grouped in replications, it 
is advisable to place similar blocks, so far as these can be discerned, 
within the same replicate. 

Certain of these designs can be rearranged to form an incoxnplete latin 
square (chapter 13) in which variation associated with two types of 
grouping can be eliminated from the experimental errors. The experi- 
menter should consider wlietlier advantage can be taken of this double 
control so as to reduce the error. 

11.4 Randomizatioix 

The steps are: 

1. Rearrange the blocks at random. (If the design is arranged in 
complete replications, the blocks are randomized only wdthin each repli- 
cation and the replications are kept separate.) 


2. Randomize the positions of the treatment numbers within each 
block. 

3. Assign treatments at random to the numbers in the plan. 

11.5 Statistical Analysis 

In order to present the shortest computational methods, it is ad- 
visable to divide the plans into five types, which are sho^vn in the index 
at the end of this chapter. 

11.61 Type I. Designs Arranged in Replications. We will analyze 
an example with 6 treatments in blocks of 2 (plan 1 1 .3) . The experiment 
was conducted by Dr. Pauline Paul at Iowa State College (11.6). Its 
object was to compare the effects of length of cold storage on the tender- 
ness and flavor of beef roasts. Six periods of storage (0, 1, 2, 4, 9, and 
18 days) were tested: these are denoted by treatment symbols 1, 2, 3, 
* • ', 6, respectively. 

Thirty roasts from the round of an animal were used. Four muscles 
each provided 6 roasts, while 3 muscles each provided 2 roasts. The 
roasts on any muscle group themselves naturally into pairs, since to each 
roast on the left side of an animal there corresponds another roast on 
the right side. From previous experience it was believed that the 2 
roasts in any pair would give closely similar results. Vaiiation among 
different pairs from the same muscle was expected to be somewfiiat larger, 
and variation among muscles to be still larger. 

These opinions prompted the use of a design in blocks of 2, each block 
comprizing the left and right roasts in a pair. When grouping the blocks 
into replications, it was natural to put roasts from the same muscle into 
the same replicate. With the first 4 muscles a separate replicate could 
be made from each muscle. The remaining replication consisted of the 
3 smaller muscles. 

The plan and the scores for tenderness are given in table 11.1. Scoring 
was done by 4 judges, each marking on a scale from 0 to 10. The scores 
shown are their totals (out of 40), a high score indicating very tender 
beef. 

The following symbols are used: ^ — 6 = the number of treatments; 
^ = 2 ~ number of units per block; r = 6 === number of replications; 
6 ~ 15 ~ number of blocks. 

1. Find the block totals, the replicate totals, the grand total, G, 
and the treatment totals, T, placed below the plan. 
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TABLE 11.1 Scores for tenderness ox^ beef 



Rep. 

r 



Rep. 

II 



Rep. : 

111 






B 





B 





B 

(1) 

7 

(2) 

17 

24 

(1) 

17 

(3) 

27 

44 

(1) 

10 

(4) 

25 

35 

(3) 

26 

(4) 

25 

51 

(2) 

23 

(5) 

27 

50 

(2) 

26 , 

(6) 

37 

63 

(5) 

33 

(6) 

29 

62 

(4) 

29 

(6) 

30 

59 

(3) 

24’ 

(5) 

26 

50 





137 





153 





148 


Rep. 

IV 



Rep. 

v' 











B 





B 






(1) 

25 

(5) 

40 

65 

(1) 

11 

(6) 

27 

38 






(2) 

25 

(4) 

34 

59 

(2) 

24 

(3) 

21 

45 






(3) 

34 

(6) 

32 

66 

(4) 

26 

(5) 

32 

58 
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Storage 

time 

(days) 

Treat- 

ment 

no. 

T 

Bt 

Q 

w 

T -f acF 

Adj. 

means 

0 

1 

70 

206 

-66 

19 

71.8 

14.4 

1 

2 

115 

241 

-11 

24 

117.3 

23,5 

2 

3 

132 

256 

8 

17 

133.6 

26.7 

4 

4 

139 

262 

16 

15 

140.4 

28.1 

9 

5 

158 

285 

31 

-24 

155.7 

31.1 

18 

6 

155 

288 

■ ■ : 22 

-51 

m,2 

30.0 


G 

-769 

1538 

0 

0 

769.0 



2. For each treatment, calculate the following quantities: 

Bt — total of all blocks in which the treatment appears 

w {t /c)r (r- I)fie + (^ - 4T m+G 

The total of the Q^s ami the TF’s should each be zero, 

3. The analysis of variance is as follows: 



d.f. 


s.s. . 

ni.s. ■ 

Replications 

(r - 1) 

4 

298.5 


Treatments (unadj.) 

it - 1) 

5 

1059.8 


Blocks within reps, (adj.) 

ib - r) 

10 

213.4 

21.34^'?, 

Intra-block error 

— il — h + I) 

10 

77.3 

7,7ZBe 

Total 

{Ir - 1) 

29 

1640.0 
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■mmmm 

liBii 


The only sum of squares requii’ing special instructions is that for blocks 
within replications, adjusted for treatment effects. Since we do not 
know any simple way of finding this directly, it is obtained by a round- 
about method. Calculate in the usual way the unadjusted sum of squares 
for blocks within replications: this comes to 753.0. Then compute the 
siun of squares for treatments, adjusted for blocks, which is 

f) ^ ^ r/A?/?'\2 t /1 1 \2 I I /oo\2l crorv 


- ■■■: = [( 66 )^ + ( 11 )^ +•••-{- ( 22 ) 2 ] ^ 520.2 

kir(k-l) (2)(6)(5)(1) 

The adjusted sum of squares for blocks is then given by the following 
combination of sums of squares 


Blocks (iinadj.) + treatments (adj.) 

763.0 + 520.2 

4. Calculate the weighting factor 


treatments (imadj.) 
1059.8 


0.09484 


r(E, - E,) 

^ rt(k - l)Ei + k(b~r-t + 1)E', 

5(21.34 - 7.73) 

= L___ = 0.09484 

(30) (21. 34) + (10) (7.73) 

The adjusted treatment totals are 

T + 

and the means per unit are found on division by r. 

5. The effective error variance per unit is estimated as 

Ee[l + {t- A)m] = 7.73[1 + (4) (0.0948)] - 10.66 

We derive ^tests from this figure by the ordinary rules. For instance, 
th e standar d error of the difference between two adjusted totals is 
V'2r(10.66), or 10.32; for the difference between two adjusted means the 
standard error is V (2)(10.66)7r, or 2.06. 

6. If an F4est is desired, calculate the sum of squares of the adjusted 

treatment totals. ThivS figure, 4718.0, is divided by r{t — 1), or in this 
case 25, in order to find the mean square on a unit basis. The mean 
square, 188.7, is compared with the effective error m.s., 10.66, which is 
assigned {tr — ^ + 1), or 10, d.f. The F-ratio, 188.7/10.66, is highly 

significant. The test is not exact, because it ignores the effects of errors 
in the weighting factor }i. 

In an experiment of this type, interest centers on the shape of the 
response curve rather than oh comparisons between pairs of treatments. 
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The adjusted means indicate the common ^Tliminishing returns^*’ curve. 
An exponential of the form y — A — where t is the time of stor- 

age, fits the data satisfactorily. A parabolic regression on which is 
easier to compute, will be found to be inadequate. The conclusion is 
that storage up to about a week increases teiiderness. 

7. The gain in precision over randomized blocks is estimated as fol- 
lows. By pooling the sums of squares for blocks within replications and 
intra-block error, we obtain an estimate of the error ni.s, that would have 
applied with randomized blocks. This figure, 14.54, is compared with, 
the effective error ni.s., 10.66. The relative precision is estimated as 
14.54/10.66, or 136%. Since the randomized block design has 20 
error d.f , as against 10 for the design actually used, this estimate should 
be reduced to 126% by the adjustment described in section 2,31. 


11.52 Type 11. Designs Arranged in Groups of Replications. Only 
two of the plans are of this type, plans 11.6 and 11.13, and in each case 
the group consists of two replications. (Plan 11.2 is also in this class, but 
is better analyzed as shown in section 11.55.) The analysis is very sim- 
ilar to that in the previous section. Two changes should be noted. If 
c is the number of groups, so that c = r/2 for the plans presented here, 
the degrees of freedom in the analysis of variance subdivide as shown 
below. 



d.f. 

m.s. 

Groups 

(c- 1) 


Treatments (imadj.) 

(t - 1) 


Blocks witiiin groups (adj.) 

(6-c) 

Eu 

Intra-block error 

(tr~t-b + J) 

E, 


Total (tr - 1) 

The second (?hange is that the weighting factor fx l)eeomes 

(6 ^ c)(Eo - E,) 




tik ^ 1)(6 - + it - Jc)(Jb - c-t + l)Ee 


11.63 Type III. Designs Not Arranged in Replications or Groups of 
Replications. Here again the method of section 11.51 may be followed 
except that no replication effects appear. The analysis of variance now 
reads as follows. 

d.f. m.s. 

Treatments (unadj.) — 1) 

Blocks (adj.) (h - 1) Eb 

Intra-block error (tr — ^ — 6 + 1) Ee 
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I As before, the sum of squares for blocks (adjusted) is found by a sub- 

i: traction process. The weighting factor become^ 

^ {b ~ l)iEi - Ee) 

j. , ^ t{lc — 1)(Z) — l)Eb + (t — k){b — t)Ee 

1 Otherwise the method of section 11.51 applies. Of course, the gain in 

accuracy relative to randomized blocks cannot be estimated. A worked 
^ example with t - 9, k = 4, has been given by Yates (11.5, 11.7). His 

method of calculating the adjusted sum of squares for blocks is slightly 
1 ’’ different from ours, since he divides this sum of squares into two com- 

, i ponents so that the calculation becomes partially self-checking. Results 

i should be identical by either method. 



11.64 T 3 rpe IV. Experiments with t ~ 6. These designs, which can- 
not be arranged in replications, are covered by the computing instruc- 
tions given in the previous section. However, the analysis simplifies 
slightly when t = 6, because the sum of squares for blocks (adjusted) 
can be found directly. The data in table 1 1.2 are from an experiment on 
corn hybrids with i = 5 — 13, r — /c = 4. For field experiments a de- 
sign that can be laid out in separate replications is usiially preferred. 
In this experiment, conducted in 1943 by the North Carolina Agricultural 
Experiment Station, it was desired to compare 10 hybrids, all potentially 
suitable for commercial use, at a number of places throughout the state, 
and at each place to test in addition 3 standard varieties that were 
adapted to that place. With 13 treatments no design that can be put into 
replications is available, and since 4 was a convenient number of replica- 
tions, the design with ^ = 13, r = fc = 4 was chosen. Randomized 
blocks might have been equally good. Incidentally, the plan used 
is different from that given in plan 11.22, a different method of con- 
struction having been emploj^ed. Both plans have the same structural 
properties. 

1. Find the block totals, B, the treatment totals, 5P, and for each 
treatment find the total Bt of all blocks in which the treatment appears. 
Time is saved if T and Bt are found simultaneously. The Bt values should 
add to k times the grand total G. 

2. For each treatment find 

W (t-k)T it - 1)B, + (k - 1)G 

, == 9? -■ 12Bf + 3G (in this example) 

The W's should add to zero. 



TABLE 11.2 Plan and yields of corn (pounds per plot) 


Block 

Block totals 


1 

(3) 

25.3 

(6) 

19.9 

(9) 

29.0 

(11) 

24.6 

98. S 

2 

(8) 

23.0 

(4) 

19.8 

(8) 

33.3 

(12) 

22,7 ' 

98.8 

3 

(10) 

16.2 

(11) 

19.3 

(12) 

31.7 

(13) 

26,6 ■ 

93.8 

4 

(2) 

27.3 

(5) 

27.0 

(8) 

35.6 

(11) 

17.4 

107.3 

5 

(7) 

23.4 

(8) 

30,5 

(9) 

30.8 

(10) 

32.4 

117.1 

6 

(4) 

30.6 

(5) 

32.4 

(B) 

27.2 

(10) 

32.8 

123.0 

7 

(1) 

34.7 

(5) 

31,1 

(9) 

25.7 

(12) 

30.5 

122.0 

8 

(3) 

34.4 

(5) 

32.4 

(7) 

33.3 

(13) 

36.9 

137,0 

9 

(1) 

38.2 

(2) 

32.9 

(3) 

37.3 

(10) 

31.3 

139.7 

10 

(2) 

28.7 

(4) 

30.7 

(9) 

26.9 

(13) 

35.3 

121,6 

11 

(1) 

36,6 

(4) 

31.1 

(7) 

31.1 

(11) 

28.4 

127.2 

12 

(1) 

31.8 

(6) 

33.7 

(8) 

27.8 

(13) 

41.1 

134.4 

13 

(2) 

30.3 

(B) 

31.5 

(7) 

39.3 

(12) 

2(>.7 

127.8 


1548.5 


No. 

T 

Bt 


If 

T-hiiW 

1 

141.3 

523.3 

-- 

•362.4 

136.7 

2 

119.2 

496.4 

, -- 

•238.5 

116.2 

3 

120.0 

474.3 


33.9 

120.4 

4 

112.2 

470.6 


8.1 

112.3 

5 

122.9 

489.3 


-120.0 

121.4 

6 

112.3 

484.0 

- 

-151.8 

110.4 

7 

127.1 

509.1 

- 

-319,8 

123.0 

8 

127.2 

457,6 


290.1 

131,0 

9 

112.4 

459.5 


143.1 

114.2 

10 

112.7 

473.6 


- 23.4 

112.4 

11 

89.7 

427.1 


327.6 

93.9 

12 

111.6 

442.4 


341.1 

115.9 

13 

139.9 

486.8 


63.0 

140.7 


1548.5 

B194.0 


0 

154:8.5 

The anal 3 ’’sis of variance 

is: 






d.f. 


■ s.s. ■ 

m.s. 

Treatments (unadj.) 

(t - 1) 

12 

542.67 


Blocks (adj.) 


(6 - 1) 

12 

475,27 

39.61.^/, 

Intra-block error (tr 

— 2{ + 1) 

27 

538.21 

19.93£?e 

Total 


{tr - 1) 

51 

1556.15 



The sum of squares for blocjks (adjusted) is found directly as the sum 
of squares of the TF’s, divided by tr(t — /c)(/c — 1), or in this 
(4)(b3)(9)(3) = 1404. 


11.54 


TYPE IV. EXPERIMENTS WITH t = 6 
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4, Tlie weighting factor ju simplifies to 

19.68 

“ tik - l)Eb (13)(3)(39.61) 

The adjusted treatment totals are the quantities (T + iuTT). 

5. As before, the effective error m,s. per unit is taken as 

EAl + (t - /c)m] - 19.93[1 + (9)(0.0127)] - 22.2 
Other features of the analysis are the same as in previous sections. Yates 
(11,5) presents an example with t = 21, k = 6. 

11.66 Type V* Small Experiments. In certain of the smaller de- 
signs, the numbers of degrees of freedom in the mean squares for blocks 
and intra-block error are rather small Consequently, the estimates of 
the relative weights assigned to inter- and intra-block comparisons are 
poor. In these cases, unless the whole plan has been repeated to secure 
extra replication, it is best to use the original method of analysis that 
Yates developed (11.8). This method adjuvsts the treatment means for 
differences between the blocks, but makes no use of inter-block informa- 
tion. The analysis is a little simpler than when inter-block information 
is iised. ^ ■ . ■ ' ■ 

For purposes of illustration, this analysis will be applied to the ex- 
ample in the previous section, with t == b — L3, r ~ /c == 4, though in 
practice inter-block information should be utilized in an experiment of 
this size. The data are presented in table 11.2. 

1. Find the block totals, and the treatment totals, For each 
treatment find the total Bf over all blocks in which the treatment appears. 

2. For each treatment compute 

Q^kT^ Bt 

t-l 12 

f ^ m + Q = 771 + ;; p- Q - 771 + 0,0769Q 

trik ^ 1) (13)(4)(3) 

where = 1548,5/52 — 29.78 is the mean of the whole experiment. 
The values are the adjusted treatment means. The Q values should 
add to zero. 

3. The analysis of variance is shown below. 



d.f. 


s.s. 

m.s. 

Blocks (unadj.) 

(5 - 1) 

12 

689.38 


Treatments (adj.) 

(t-1) 

12 

328.55 

27.38 

Intra-block error 

{It -t-h + l) 

27 

638.22 

19.93F, 

Total 


'-51 

1556.15 



Note that the blocks sum of squares is imad justed. It is found in the 
usual w^ay as the sum of squares of deviations of the divided by k. 




11,56 


MISSING DATA 


This rule applies even wlieri the experiment m arranged in repli('ation,s 
or gi'oups of replications, though if tlie experimenter wishes, ho may di- 
vide the blocks sum of squares into that due to replications (oi- groups) 
and that due to blocks within replications. 

The adjusted sum of squares for treatments is 

i .1 12 

Z Q® = — ; [(-13 -9)“ + (i9.(j)^ -f i- (72.8)=*1 

rtfe(fc-l)^ (4.) (13) (4) (3) 

= 328.55 

As usual, the intra-block error s.s. is found by subti’action. 

4. For /-tests, the effective error va-riancf? is 


/ — /c 


- 


Ht - 1) 


(19.h3)(12)(4) 
03) (3) 


- 24J 


t(h - 1) 

For the variance of the difference between two adjusted i.reatment 
means, we multiply this quantity by 2/r. 

5. The analysis above provides an exact of the adjusted treat- 

ment means. The F-ratio is 27.38/19.93, with 12 and 27 d.f. 

11.56 Missing Bata. For reasons given in section 10.13, it is recom- 
mended that the analysis of incomplete data be carried out by inserting 
for the missing observations values which minimize the intra--bl(.)ck 
error s.s. In our notation the formula, first given by Cornish (11.9), is 
Hk - i)B + k{t - i)Q -- 0 - i)a' 

where B is the total of the block containing the missing observation, Q is 
the Q value (= kT — Bt) for the treatment that contains the missing 
value, and Q' is the sum of tl)e Q values for all treatments that are in the 
block with the missing value. 

Example 1, In the expeafiment on roast beef in section 1 1.51, 
that treatment (5) in replication IV is al >sent. The first step is to 
the block totals, tlie treatment totals, the Bt and the Q values as in the 
ordinary analysis. An x should be inserted for the missing ol>servation. 
The data read as follows. 


Treatment 

number 

1 
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11.56 


32() 

Since the block with the missing value contains treatments (1) and (5) 
Q' = -26 - 9 = -35 

Hence, since 5 is 25, 

305 + lOQ - 5Q' 6B + 2Q ~ Q' 

___________ 4“' 

(6) (25) + (2)(-26) - (-35) _ 

— _ — ^ ~ — 

4 

On substituting this value in all places where an x occurs in the table 
above, we are reacty to proceed with the analysis of variance. 

Example 2. For an example with 2 missing observations, suppose 
that treatments (8) and (13) are missing from block 12 in the corn ex- 
periment in section 11,54. Denote the values to be inserted by x and 
respectively. To save space, the ]:)reliminary data are presented only 
for the treatments that occur in the same bk^ck as tlie missing observa- 
tions; these are treatments (1), (G), (8), and (13). 

Treatniont 
number 
1 
(i 
S 
13 


131.1 - Q' 

B 65.5 -f 0? -f ?/ 


T 

141.3 

112.3 
99.4 + X 
98.8 4“ y 


Bt 

454.4 + X y 
415.1 + ir 4“ y 
3SS.7 T 4- 
417.9 + x + y 


Q - 4T - Bf. 
110.8 - X - y 
34.1 — X — y 
8.9 4- 3:r y 
-22,7 - a: -f 3?y 


In the 3 replications in which it appears, the average for treatment (13) 
is 33 : this will be chosen as the first approximation to y. Then for x \ve 

have'. ..'■■■ 

B - 65.5 + 33.0 - 98.5; Q = 8,9 33 - --24.1; Q' - 131.1 

Since ^ — 13, /c — r — 4, 

= (‘3)(98-5) + (4)(-24.1) - (131.1) ^ ^ 

: ■ ; ' - 27 : r" ‘ 

We now substitute x = 39.0 in the preliminaay data and obtain a 
second approximation to y. For this we have 

B = 65.5 + 39.0 = 104.5; Q = -22.7 - 39.0 = -61.7 
Q' = 131.1 
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Note that Q', which does not involve x or ?/, remains fixed throughout the 
calculation. The second approximation to t/ is 

(13^104.5) + (4)(~-61.7) - (131.1) 

V ■ 


27 


36.3 


The next 2 approximations will be found to be .r - 40.1, y = 36.7, 
which may be used as the values to be inserted. 

The rest of the analysis proceeds as usual; the intra-block error de- 
grees of freedom are reduced by 1 for each missing observation. In the 
calculation of standard errors for ^-tests of the adjusted treatnauit 
means, the method given in section 10.13 should be followed. 
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TABLE 11.3 

Index 

PLANS 



t 

k 

r . : " 

h 

xt 

E 

Plan 

Type 

4 

2 

3 

6 

1 

.67 

11,1 

V 


3 

3 

4 

2 

.89 

■ * 

V 

■5 ■ 

:2.. ■ 

4 

10 

1 

.62 

• 11.2 

V 


3 

6 

10 

3 

,83 

* 

HI 


4 

A , 

■5, 

3 

.94 

* 

V 

6 

■ O'' 

5 

15 

1 

.60 

11.3 

I 


3 

^ ■ 5' , 

10 

2 

.80 

11.4 

in 


3 

10 

20 

4 

.80 

11.5 

I 


4 

10 

15 

6 

.90 

11.6 

II 


5 

5 

6 

4 

.96 


V 

7 

2 

6 

21 

1 

.58 


HI 


3 

3 

7 

1 

.78 

11.7 

Y 


4 

4 

7 

2 

,88 

11.8 

¥ 


6 

6 

7 

5 

.07 
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TABLE 11.3 Index to Thms (Continued) 


t 

k 

r 

b 

xt 

E 

Plan 

Type 

8 

2 

7 

28 

1 

.57 

11.9 

I 


4 

7 

14 

3 

.86 

11.10 

I 


7 

7 

8 

6 

.98 

« 

V 

9 

' 2 

8 

36 

1 

.56 

% 

III 


4 

8 

18 

3 

.84 

11,11 

III 


6 

10 

18 

5 

.90 

11,12 

III 


6 

8 

12 

5 

.94 

11.13 

II 


8 

8 

9 

7 

.98 

* 

IV 

10 

2 

9 

45 

1 

.56 

11.14 

I 


3 

9 

30 

2 

.74 

11.16 

III 


4 

6 

15 

2 

.83 

11.16 

III 


5 

9 

18 

4 

.89 

11.17 

III 


6 

9 

15 

5 

.93 

11.18 

III 


9 

9 

10 

8 

.99 

* 

IV 

11 

2 

10 

55 

1 

.55 

* 

III 


5 

5 

11 

2 

.88 

11.19 

IV 


6 

6 

11 

3 

.92 

11.20 

IV 


10 

10 

11 

9 

.99 

* 

IV 

13 

3 

0 

26 

I 

.72 

11.21 

III 


4 

4 

13 

1 

.81 

11.22 

IV 


9 

9 

13 

6 

.96 

11.23 

IV 

16 

a 

7 

35 

1 

.71 

11.24 

I 


7 

7 

15 

3 

.92 

11.25 

IV 


8 

8 

15 

4 

.94 

11.26 

IV 

16 

6 

6 

16 

2 

.89 

11.27 

IV 


6 

9 

24 

3 

.89 

11.28 

III 


10 

10 . 

16 

6 

.96 

11.29 

IV 

19 

3 

9 

57 

1 

.70 

11.30 

III 


9 

9 

19 

4 

.94 

11.31 

IV 


10 

10 

19 

5 

.95 

11.32 

IV 

21 

3 

10 

70 

1 

.70 

11.33 

I 


■■ 6 

6 

21 

1 

.84 

11.34 

IV 


7 

10 

30 

3 

.90 

11,35 

III 

25 

4 

B 

50 

1 

.78 

11.36 

III 


9 

9 

25 

3 

.93 

11.37 

IV 

28 

4 

9 

63 

1 

.78 

11.38 

I 


7 

9 

36 

2 

,89 

11,39 

III 

31 

6 

6 

31 

1 

.86 

11.40 

IV 


10 

10 

31 

3 

.93 

11.41 

IV 

37 

9 

9 

37 

2 

.91 

11.42 

IV 

41 

5 

10 

82 

1 

.82 

11.43 

III 

67 

8 

8 

57 

1 

.89 

11.44 

IV 

73 

9 

9 

73 

1 

.90 

11.46 

IV 

91 

10 

10 

91 

1 

.91 

11.46 

IV 


* These plans are constructed by forming all possible combinations of the t numbers 
in groups of aim h The number of blocks b serves as a check that no group has been 
missed. 

t Number of times that two treatments appear together in the same block* 
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Plan 11. i i “ ‘ij /c “ 2, r “ «3, & = 6, X — L ^ “ 0.67, Typo Y 

Block Rep. I Rop. II Rep. HI 
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/. — 5, k 
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Heps. I 
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Plan il,H 
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1 = 6, k = 

= 3,r 
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Plan 11 M 


= 6,1 

c « 4, r 

= 10, 

b ^ 

15, 

X = 6,£' = 

= 0.90 

, Type II 
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Plan. 11.7 
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)• = 3, 

b = 
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r — 4, 
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0.88, 

Type V 
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Plan 11.9 
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= 8, 

& =2, ) 

^«7, 

h ^ 

28. 
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Type I 
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(28) 

: 4 

6 






Plan 11.10 


-8, 

fc =4, ? 

•«7, 

h - 

14, 

X = 3 

1, £1 = 

0.86, 

Type 1 




Block Rep. I 
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Plan 11.11 
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=-9 

,k 

= 4,i 

= 8,6 

= 18, X = 
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^ 0.84, Type III 
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(14) 
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9 

(16) 

4 
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(4) 

1 

3 
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7 
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9 

(16) 
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5 

“6 
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8 
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9 
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8 
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Plan 11.12 

1=9, 

h 

= 5, r 

= 10, b 

= IS, X 
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= 0.90, Type 111 
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1) 

Plan 11.13 
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Plan 11.14 t = 10, k = 2, r = 9, b = 45, X = 1, E = 0.56, Type I 
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Plan 11.15 i = 10, i: = 3, r = 9, 6 = 30, X >= 2, E = 0.74, Type III 
Block 
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PUn 11.16 « = 10, A: = 4, ?■ = 6, 6 = 15, X = 2, E = 0.83, Type III 

Block 



(1) 

1 

2 

3 

4 

(6) 

1 

6 

8 

10 

(11) 

3 

5 

9 

10 



(2) 

1 

2 

6 

"" 6" , 

(7) 

2 

3 

6 

9 

(12) 

3 

6 

7. 

10 



(3) 

1 

3 

7 

8 

(8) 

2 

4 

7 

10 

(13) 

3 

4 

5 

8 



(4) 

1 

4 

9 

10 

(9) 

2 

5 

8 

10 

(14) 

4 

5 

6 

7 




(5) 

1 

5 

7 

9 

(10) 

2 

7 

8 

9 

(15) 

4 

6 

8 

9 



Plan 11.17 
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Plan 11.19 i - 11, A; - 5, r « 5, 6 « 11, X - 2, JS? «= 0*88, Type IV 
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Plan 11,20 i -llyk ^ 6, r - 6, 5 ~ H, X 3, ~ 0.92, Type IV 

Block 


( 1 ) 

4 

6 

7 

9 

10 

11 

(7) 

r 

2 

4 

5 

6 10 

(2) 

1 


7 

8 

10 

11 

(8) 

¥ 

3 

5 

6 

7 ' 

11 

(3) 

r 

2 

6 

8 

9 

11 

(9) 

1 

3 

4 

6 

7 

8 

(4) 

1 

2 

3 

7 

9 

10 

(10) 

2 

4 

5 

7 

8 

9 

(5) 

2 

T" 

4' 

8 

10 


(11) 

3 “ 

5 

6 

8 

9 

10 

(6) 

1 ’ 

3 

“S" 

'5 

9 

11 









PhLii 11.21 i = 13, k = 3, r - 6, 6 ~ 26, \ ^ I, E ^ 0.72, Type III 
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Plan IIM « = 15, fc = 3, r = 7,7> = 
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CHAPTER 12 


LATTICE SQUARES 
12.1 Description 

12.11 Balanced Lattice Squares. The number of treatments must be 
an exact square. Within each replicate, the treatments are arranged 
on the plan in Q.kXk square. The method of grouping into rows and 
columns, which varies in successive replications, is such that the treat- 
ment means can be adjusted for differences among the rows and columns 
of each square. Thus, in addition to the elimination of differences among 
replicates from the experimental errors, the design permits a ^‘double 
controF^ within each replicatCj similar to that obtained in a latin square. 

The principle which governs the grouping into rows and columns ma.y 
be seen from the plans 12.1-12.8. With 9 treatments, for instance, treat- 
ment 1 occurs in the same row as treatments 2, 3, 6, and 8, and in the 
same column as treatments 4, 5, 7, and 9. Thus everj^ other treatment 
appears with treatment 1 either in the same row or in the same column. 
More generally, any pair of treatments occurs once in the same row or in 
the same column. 

This property holds for all plans having an odd number of treatments, 
i.e., for 9, 25, 49, 81, 121, and 169 treatments. The number of replicates, 
for treatments, is {k + l)/2. The standard error of the difference is 
not exactly the same for all pairs of treatments, though the variation in 
accuracy is small. Separate formulae are given for the two standard 
errors for cases in which they may be needed. 

A design with twice the basic number of replicates (e.g., 25 treatments 
in 6 replicates) is obtained by repeating the plan, the roios being mtcr- 
changed with the cohmns. In designs so produced, every pair of treat- 
ments occurs together once in the same row and once in the same column. 
On account of this property, the standard error of the difference is the 
same for all pairs of treatments, whether the row and column differences 
are large or small. 

Within the most useful range, the only even numbers of treatments 
which provide lattice square designs are 16 and 64. These designs have, 
respectively, 5 and 9, i.e, (k + 1) replicates. Every pair of treatments 
occurs once both in the same row and in the same column. 

The available selection of designs (up to 12 replicates) is summarized 
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in table 12,1. By analogy with balanced lattices, the designs may be 
called 6aZancec? lattice squares. 

TABLE 12.1 Available numbers of treatments and replications for 

BALANCED LATTICE SQUARES 

Number of treatments 9 16 25 49 64 81 121 169 

Number of replicates 4,8 5,10 3,6 4,8 9 5,10 6,12 7 

For 16 treatments in 10 replicates, the basic plan is used twice. Al- 
though the basic plan (plan 12.1) requires only 2 replicates, the design 
for 9 treatments is not recommended with less than 4 replicates. Even 
in this case there are only 8 d.f. for estimating row and column variances 
and only 8 d.f. for error. Little would be gained over randomized blocks 
unless the extra controls were highly effective. A design with 9 treat- 
ments and 8 replicates is found by repeating the design for 9 treatments 
and 4 replicates. 

Certain factorial experiments can be arranged in halanced lattice 
squares; e.g., 3 X 3,8 X 2, 4 X 4, 4 X 2 X 2, 2^^, 5 X 5, 7 X 7,8 X 8, 
4^, 2^, 9X9, 3'h All main effects and interactions are confounded to the 
same extent with rows and cohmins. 

12.12 Partially Balanced Squares. For a balanced design, the availa- 
ble numbers of replications are rather severely restricted. Although de- 
signs are possible for other numbers of replicates, they lack the symmetry 
of the designs described above, with the consequence that the statistical 
analysis usually becomes more complicated. It happens, however, that 
if the number of replications is less than that required for a balanced de- 
sign, the statistical analysis follows the same procedure as for balanced 
designs, apart from minor changes in some formulae. In experimenta- 
tion where double grouping has proved effective but where the replica- 
tions are not sufficient for a balanced lattice square, the additional de- 
signs shown in table 12.2 may be useful. 

TABLE 12.2 Other lattice square designs 

Number of treatments 49 64 81. 121 169 

Number of replicates 3 3, 4 3, 4 3, 4, 5 3, 4, 6, 6 

When the number of treatments is odd, the plans for these designs are 
obtained simply by taking the desired number nf replicates from the 
plans for the balanced designs. Thus, for 49 treatments in threefold 
replication, we use squares I to III from plan 12.4. With 64 treatments, 
squares I, III, and V are used for 3 replicates, and squares I, III, V, and 
VII for 4 replicates. 

Partially balanced lattice squares are analogous to the triple, quad- 
ruple, and quintuple lattices. 
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12.13 Arrangement of Experimental Material. The method of ar- 
ranging tlxe experimental material is similar to that for an ordinaiy latin 
square. With treatments, the units are arranged in k X k squares so 
that the rows and columns of each square corre*spond to the two types of 
variation whose effects we wish to eliminate from the errors. In field ex- 
periments, the plots of each replicate ai’e usually laid out in square forma- 
tion, in which case row and column differences represent fertility?' vari- 
ations in two directions at right angles to each other. If the width of a 
greenhouse bench accommodates 5 pots, 25 treatments can be laid out in 
replicates which consist of 6 rows of 5 pots each. 

12.14 Randomization. Within each replicate the row^s and columns 
of the basic plan should be permuted separately at random before appty- 
ing the treatments. It is also advisable to assign the treatments at ran- 
dom to the treatment numbers in the plan. 


12.2 Statistical Analysis 

12.21 Designs with {k + l)/2, or Fewer, Replications. The same in- 
structions apply to the partially balanced designs and to the balanced 
designs with (A + l)/2 replications. Table 12.3 shows the plan and 
yields for a lattice square with 25 corn hybrids in 3 replications, con- 
ducted by the North Carolina Experiment Station in 1942^ Each plot 
contained 4 rows, vnth 10 hills per row. 

The computations proceed as follows. The number of treatments is 
A^, and the number of replications r. 

1. Find the row, coliunn, treatment, replication, and grand totals. 
The treatment totals are placed in a square under the plan. 

2. For each row, calculate the L value, where 

L — total (from all replications) of all treatments included in the 
row — r times row total 

Li - 89.8 + 93.3 + 82.1 + 88.9 + 91.1 -- 3(159.1) - -~32.1 
As a check, the total of the L values over a replication equals 
(Grand total) — r(repIication total) 

Thus, 

--76.2 - 2189.4 -- 3(755.2) 

Similarly for each column we obtain M values. 

M = total (from all replications) of all treatments included in the 
column — r times coliunn total 



TABLE 12.3 Yields (pounds corn per plot) for a 5 X 5 lattice sqttark in 

3 replications 


Treatment numbers are shown in parentheses 




S(iuare I 



Total 

L 

5 

(18) 

33.3 

(9) 30.7 

(1,1) 35.4 

(2) 30.1 

(25) 29.6 

159,1 

-32.1 

-2.58 

(24) 

24.6 

(15) 30.8 

(17) 28.8 

(S) 34.8 

(1) 32.5 

151.5 

-10.5 

-0.84 

(12) 

28.5 

(3) 24.0 

(10) 28.4 

(21) 25.0 

(19) 35.1 

141.0 

+22.4 

+ 1.80 

(6) 

26.7 

(22) 27.2 

(4) 25.6 

(20) 25.0 

(13) 29.4 

133,9 

+11.0 

+0.S8 

(5) 

40.1 

(16) 35.7 

(23) 30.1 

iU) 30.3 

(7) 33.5 

169.7 

-67.0 

-5.39 

Total 153.2 

148.4 

148.3 

145.2 

160.1 

755.2 


-6.13 

M 

-16.0 

+3.8 

-12.8 

-10.0 

-41.2 


-76.2 


« 

- 0.83 

+0.20 

- 0.66 

- 0.52 

- 2.13 

-3.94 





Square II 






(20) 

30.9 

(17) 33.3 

(19) 38.8 

(16) 27.7 

(18) 34.4 

165.1 

- 45.9 

- 3.69 

(15) 

37.2 

(12) 31.2 

(14) 27.9 

(11) 27.3 

(13) 21.6 

145.2 

- 5.8 

- 0.47 

(25) 

32.7 

(22) 43.0 

(24) 28.5 

(21) 24.7 

(23) 22.7 

151.6 

- 46.2 

- 3.71 

(5) 

32.0 

(2) 32.8 

(4) 31.8 

(1) 28.7 

(3)32.3 ^ 

157.6 

- 33.8 

- 2,72 

(10) 

39.8 

(7) 37.3 

(9) 31.9 

(6) 34.0 

(8) 34,3 

177.3 

- 69.3 

- 5.57 

Total 172.6 

177.6 

158.9 

142,4 

145.3 

796.8 


-16.16 

M 

-62.2 

00 

1 

-30.1 

-12.6 

-10.6 


-201.0 


€ 

- 3.22 

- 4.43 

- 1.56 

- 0.65 

- 0.55 

-10.41 




Square III 


(19) 

28.7 (15) 26.3 

(23) 

21.7 

(6) 21.9 

(2) 

26,0 

124.6 

+ 69.1 

+ 

5.56 

(11) 

19.4 

(7) 17.3 

(20) 

16.9 

(3) 22.6 

(24) 

34.2 

100.4 

+ 98.0 

+ 

7.88 

(22) 

18.3 (18) 22.1 

(1) 

17.5 (14) 25.0 

(10) 

36.9 

109.8 

+105.9 

+ 

8,51 

(5) 

30.2 (21) 27.5 

(9) 

30.7 (17) 28.1 

(13) 

37.6 

, 144.1 

+ 9.3 

+ 

0.75 

(8) 

34.4,', 

(4) 32.8 

(12) 

31.9 (25) 28.8 

(16) 

i0.C> 

158.5 

- 5.1 


0.41 

Total 131.0 

126.0 

118.7 

126.4 

135.3 

637.4 


+22.29 

M +86.0 

+61.6 

+54.8 

+46.8 

+28.0 


+277.2 



« + 

■ 4.45 

+ 3.19 + 2.84 

+ 2.42 

+ 

1.45 

+14.35 









Grand total 

2189.4 

0 


0 





Treatment 

totals 






(1) 

78.7 

(2) 

88.9 

(3) 

78.9 

(4) 

90.2 

(5) 

102.3 



(6) 

82.6 

(7) 

88.1 

(8) 

103.5 

(9) 

93.3 

(10) 

95.1 



(11) 

82.1 

(12) 

91.0 

(13) 

78,6 

(14) 

83.2 

(15) 

94.3 



(16) 

94.0 

(17) 

90.2 

(18) 

89.8 

(19) 

102.6 

(20) 

72.8 



(21) 

77.2 

(22) 

88.5 

(23) 

74.5 

(24) 

77.3 

(26) 

91.1 

2189.4 





Adjusted treatment totals 





(1) 

83.7 

(2) 

85.7 

(3) 

87.9 

(4) 

88.9 

(5) 

95.3 



(6) 

84,4 

(7) 

816 

(8) 

lOO.l 

(9) 

87.4 

(10) 

97.4 



(11) 

90.1 

(12) 

90.1 

(13) 

78.5 

(14) 

86.2 

(16) 

98.7 



. (18) 

85.5 

(17) ■, 

83.8 

^ (18) 

93,8 

(19) 

107.0 

(20) 

77,0 



(21) 

78.1 

(22) 

94.4 

(23) 

72.6 

(24) 

79.7 

(25) 

81.5 
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3. We ttow compute the. analysis of variance. The total s.s. and the 
sums of squares for replications and treatments are found in the usual 
way. If the symbol Lr denotes a replication total of the L’s, the stun of 
squares for rows within replications, adjusted for treatments, is 

_ E^ 

1) /A(r - 1) 

(32.1)2 ^ (10^5)2 ^ (5.1)2 (76.2)2 (201.0)2 (277.2)2 

“ ^ " 150 

= 1405.95 - 820.32 = 585.63 
The siun of squares for columns, eliminating treatments, is 
(16.0)2 ^ (3.8)2 + . . . + (28.0)2 (76.2)2 + (201.0)2 + (277.2)2 

lb ^ ~ " ~ 150 

= 1058.53 - 820.32 = 238.21 
The complete analysis of variance is shown in table 12.4. 

TABLE 12.4 Analysis of vaiuance for a 5 X 5 lattice square 



d.f. 


s.s. 

m.s. 

Eeplications 

(r-1) 

2 

546.88 

273.44 

Treatments 

(lr - 1) 

24 

611.00 

25.46 

Eows (adj.) 

rik - 1) 

12 

585.63 

4S,S0Er 

Columns (adj.) 

rik - 1) 

12 

238.21 


Error 

(k - l)(rk -r - k-1) 

24 

229.70 

9,o7Ee 

Total 

(rk^ - 1) 

74 

2211.60 



4* This step leads to the adjusted treatment means. Let 

Er == mean square for rows 

Ec ~ mean square for columns 

Ee == mean square for error 

, (Er - E,) 48.80 9.57 

X' 0.0804 

k(r ~ l)Er (5)(2)(48.80) 

, Ee) __ 19.85 -- 9.57 _ ^ 

^ ~ Hr- 1)E, ~ (5)^19.85) “ 

If ErOvEfi is less than then X' or ju', as the case may be, is taken as 0- 
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5. Multiply the Us hy X' to give S values, and the M’s by m' to give 
£ values, as presented in table 12.3. 

6. The adjusted total for any treatment is secured by adding to the 
unadjusted total the 5 and e values for each row and column in which 
the treatment appears. For treatment (1), we have 

78.7 + (-0.84) + (-2.13) + (-2.72) + 

(-0.65) + (8.51) + (2.84) = 83.7 

To obtain the treatment means (not shown) we divide by the number 
of replicates. 

7. The average variance oi the difference between two adjusted treat- 
ment means is 


r 


1 + 


rh 


(fc + 1) 


(X' -f \i!) 


2(9.57) 


(3)(5) 

1 + (0.1322) 


= 8.49; s.e. 


6 
2.91 


Except perhaps with the 5 X 5 and smaller squares, it is sufficiently ac- 
curate to use this figure for comparisons between any pair of treatments. 
More accurately, for two treatments that appear in the same row, the 
variance of the difference is 


2Ee 

— [1 -f (r - 1)X' -f V] = 8.40; s.e. = 2.90 

r - , 


For two treatments in the same column, 

2E/ 


[1 -h r\' + ir- 1)m'] = 8.58; s.e. = 2.93 


Clearly the average error is good enough in this experiment. For the 
partially balanced designs one more formula is needed, since some pairs 
of treatments do not occur together either in a row or in a column. The 
vaiianee of the difference is 

™[l-hKX'+M')] 

■ r '■ 

8. As is typical of these designs, the analysis does not provide an exact 
F~test, The treatments m.s. in the analysis (table 12.4) cannot be com- 
pared with the error m.s., since the former has not been adjusted for row 
and column effects. For an approximate F-test, compute the sum of 
squares of deviations of the adjusted treatment totals in table 12.3. This 
comes to 1600,05. Division by 3, since there are 3 replicates, gives 


m 
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535,36, with a mean square of 22.31. This may be compared with the 
effective error m.s., which is 

I'Jc 15 

Ee 1+ — (X' + /) =9.57 1 + ^(0.1322) = 12.73 
L fc + 1 J L b J 

The F-vatio is 22.31/12.73, or 1.75, with 24 and 24 d.f. 

9. The gain in precision relative to randomized blocks is estimated as 
follows. If the experiment were analyzed as randomized blocks, rows 
and columns would be amalgamated with error, giving 



d.f. 

s.s. 

m.s. 

Replications 

2 

546. 8S 


Treatments 

24 

611.09 


Error 

48 

1053.63 

21.95 


This error, 21.95, is compared with the effective error m.s., 12.73. The 
relative information is estimated as 21.95/12.73, or 172%. Thus 3 
replicates of the lattice square appear about as precise as 5 with ran- 
domized blocks. 

Often a comparison with a lattice design will be of more mterest. This 
experiment could have been planned as a triple lattice, and in that event 
the rows would probably have been chosen as blocks. The intra-block 
error would be derived from the pooled smn of squares for columns and 
error in table 12.4. The mean square would be 13.00, with 36 d.f. Con- 
sequently, for the triple lattice we have = 48.80 (m.s. for rows), Ee = 
13.00. The effective error m.s. (section 10.28) is 


Fe \ 1 + 


13.00 1 + 


r rfeM 1 r riEb - Be) ' 

L fc + iJ .. (fc + l)(r - 1)eJ 

r 3(48.80 - 13,00)1 

= 13.00 1 + ~77— ^ = 15.38 

L (6)(2)(48.80) J 

The relative accuracy of the lattice square to the triple lattice is 
estimated as 15.38/12.73, or 121%. 

An account of the theory and a worked example are given by Yates 
(12.1) for balanced squares and by Cochran (12.2) for partially balanced 
squares. 

12.22 Designs with (k + 1) Replications, In this case the analysis is 
slightly different, since it is possible to apply a single adjustment for all 
the rows in which a treatment lies, instead of making a separate adjust- 
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TABLE 12.6 Pebcent ages op squares attacked by bold weevils for a 
4 X 4 LAmCE SQUARE IN (fe +1) REPLICATIONS 


Boll weevil infestation 


Square I 


Row 

totals 


(10) 

9.0 

(12) 

20.3 

(9) 

17.7 

(11) 

26,3 

73.3 

(2) 

4.7 

(4) 

9.0 

(1) 

7.3 

(3) 

8.3 

29.3 

(14) 

9.0 

(16) 

6.7 

(13) 

11.7 

(15) 

4.3 

31.7 

(fi) 

4.0 

(S) 

6.0 

(5) 

5.7 

(7) 

14.3 

29.0 

Column totals 

26.7 


41.0 


42.4 


53.2 

163.3 




Square II 





(5) 

19.0 

(12) 

8.7 

(16) 

13.0 

(2) 

15.7 

56.4 

(10) 

12,0 

(7) 

6.0 

(4) 

15.3 

(13) 

12.0 

45.3 

(16) 

12.7 

(1) 

6.3 

(6) 

1.7 

(11) 

13.0 

33.7 

(3) 

3.7 

(14) 

3.7 

(0) 

8.0 

(8) 

13.3 

28.7 

Column totals 

47,4 


24.7 


38.0 


54.0 

164.1 




Square III 





(10) 

17.0 

(15) 

7.0 

(8) 

10.3 

(1) 

1.3 

35.6 

(ft) 

11.3 

(16) 

12.3 

(7) 

3,0 

(2) 

5.3 

31.9 

(12) 

12.3 

(13) 

8.7 

(6) 

8,0 

(3) 

9.3 

38.3 

(11) 

30.3 

(14) 

22.3 

(5) 

ILO 

(4) 

12.7 

76.3 

Column totals 

70.9 


60.3 


32.3 


28.6 

182.1 


Square IV 


(16) 

5.0 

(12) 

10.3 

(11) 

2.7 

(15) 

6.7 

(1) 

1.0 

(5) 

10.3 

(0) 

11.0 

(2) 

19.0 

Column totals 

19.7 


46.3 


(8) 

5.7 

(4) 

12.7 

33.7 

(3) 

10.3 

(7) 


25.4 

(0) 

11.3 

(13) 

; 11.7 

34.3 

(14) 

20.7 

(10) 


80,4 


48.0 


59.8 

173.8 


Square V 


(3) 

2.0 

(16) 

5.0 

(6) 

9.3 

(9) 

1.7 

(12) 

16.7 

(7) 

4.3 

(18) 

16.7 

(2) 

30.0 

Column totals 

44.7 


41.0 


(5) 

4,0 

(10) 

13,7 

24.7 

(4) 

6.3 

(15) 

12.3 

29,6 

(14) 

18.7 

(1) 

8.7 ' 

48.4 

(11) 

25.7 

(8) 

14.0 

86.4 


54.7 


48.7 

189.1 
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ment for eveiy row. On the whole, however, the analysis is more com- 
plex than wnth (fc + l)/2 replications. 

The example is a 4 X 4 lattice square conducted by the U. S. Bureau 
of Entomology and Plant Quarantine at Florence, S. C., and described 
by Wadley (12.3). The treatments were 16 arsenical insecticides applied 
to cotton with a hand dusting machine. Plots were 10 rows wide and 
70 feet long, being about To allow for border effects, records 

were taken only from the 4 center rows. The data in table 12.5 are the 
percentages of squares showing attack by boll weevils. These figures 
were obtained by examining 100 squares per plot, 25 from each of the 
4 center rows. Such counts were made at intervals during the summer: 
the data are averages from 3 counts made in August. 

The simplest computational routine, devised by Yates (12.1), will he 
presented here, though it is not too well adapted for making clear the 
meaning of the various steps. Yateses paper should be consulted for an 
account of the theory. 


TABLE 12.6 Oeiginal and adjusted teeatment totals 



Treat- 

ment 








Adj. 

treat- 

ment 

Adj. 


T 

Et 

Ct 

D 

u 




totals 

means 

1 

24.6 

181.3 

164.1 

. 17.2 

64.3 

81.5 

133.1 

150.3 

32.24 

6.45 

2 

74.7 

284.4 

196.6 

S7.8 

-250.8 

-163.0 

100.4 

188.2 

68.41 

13.68 

3 

33.6 

146.4 

221.9 

-75.5 

274.8 

199.3 

.27.2 

-102.7 

43.64 

8.73 

4 

56.0 

214.2 

222.1 

- 7.9 

25.4 

17.5 

- 6.2 

- 14.1 

56.79 

11.36 

5 

50,0 

220.7 

223.1 

■ — ■2.4 

- 31,1 

- 33.5 

- 40.7 

- 43.1 

47.20 

9.44 

6 

34.0 

211.0 

161.4 

40.6 

- 46.6 

3.0 

151.8 

201.4 

37.80 

7.58 

7 

33,3 

180.0 

211.0 

-31.0 

105.6 

74.6 

- 18.4 

- 49.4 

36.85 

7,37 

8 

48.3 

213.4 

224.0 

-10.6 

- 1.4 

- 12.0 

- 43.8 

- 54.4 

46,58 

9.32 

0 

60.0 

197.8 

240.3 

-42.5 

83.4 

40.9 

- 86.6 

- 129.1 

50.07 

10.01 

10 

81,4 

2.59.3 

253,5 

5.8 

- 98.5 

- 92.7 

- 75.3 

- 69.5 

74.57 

14.91 

11 

98.0 

295.1 

252.5 

42.6 

-211.1 

-16S.5 

- 40.7 

1.9 

87.95 

17.59 

12 

68.3 

250.1 

227.6 

22.5 

-104.9 

- 82.4 

- 14.9 

7.6 

03.51 

12.70 

13 

60.8 

236.0 

251.2 

-1.5.2 

- 64.4 

- 79.6 

-125.2 

-140.4 

63.45 

10.09 

14 

74.4 

265.5 

204.4 

61,1 

-157.5 

- 96.4 

86.9 

148.0 

71.36 

14.27 

15 

43.3 

178.7 

230.5 

-67.8 

152.1 

94,3 

- 79.1 

-136.9 

46.42 

9.28 

10 

41.7 

156.7 

199.4 

-43.7 

260.7 

217.0 

85.9 

42.2 

55.46 

11.09 

Totals 

872.4 

3489.6 

3489.6 

0 

0 

0 

0 

0 

872.30 

174.47 


1. Find the row, column, replication, and grand totals, all showm in 
table 12,5. 

2. Find the treatment totals T (table 12.6). For each treatment find 

Rt — total of all rows in which the treatment appears 
(7« == total of all columns in which the treatment appears 

* A ^^square^' is the name given to the young flower bud. 
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It ma3^ save time to find all 3 quantities, T, and simiiltaneoiis 
addition, since this means that the positions of a treatment in the dif- 
ferent replicates are found onty once. This can be done if the machine 
carriage is moved so as to accommodate 3 sets of running totals. The 
sum of the Rt values and the sum of the Gi values should each equal k 
times the simi of the T values. 

Thereafter the successive columns in table 12.6 are filled out as follows. 
D^Rt^Gt ' 

U ^ kT -- {k + l)Rt + G 

/-D + jy 

a ;=/ + (/c - i),d 

M[ = D + K 

Note that the total of each of these quantities is zero. 

3. Compute the analysis of variance. The only sums of squares re- 
quiring special instructions are those for roAVs and columns. After ad- 
justment for treatment effects, it happens unfortunately that the sums 
of squares for rows and columns are not mutually orthogonal. By a 
well-known result in the analysis of variance, their combined sum of 
squares may be expressed in either of two ways: 

Rows (adj. for treatments) + columns (adj. for treatments and rows) 
= columns (adj. for treatments) + rows (adj. for treatments and 
columns) 

Both expressions are computed. 

Sum of squares for rows adjusted for treatments 

SiU'f _ ( 64 . 3)2 + ( 250 . 8)2 . . . q, ( 260 . 7)2 

“ k\k + 1) 320 

Sum of squares for rows adjusted for treatments and columns 

S(J'^) (81.6)2 (163.0)2 + . . H- (217.0)2 


kHk - 1) 

Sum of squart's for colujnns adjusted for treatments 


192 

S{M'f 


kHk d- 1) 

Bum of squares for columns adjusted for treatments and rows 


S{K^) 
k\k - 1) 


1093.02 


- 1026.76 


625.86 


- 659.59 
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Note that 1093.02 + 559.59 = 1026.76 + 625.85. 

TABLE 12.7 Analysis of vabiancb 



d.f. 


s.s. 

m.s. 

E,epIieations 

h 

4 

31.56 

7.89 

Treatments 

- 1 

15 

1244.20 

82.95 

Hows (adj. for treatments) 


15 

1093.02 


Hows (adj. for tr, and col.) 

1 

15 

1026.76 

mABEr 

Columns (adj. for tr.) 


15 

625.85 


Columns (adj. for tr. and rows) 

kl- 1 

15 

559.59 


Error 

(k^ - l)(7c -2) 

30 

680.17 

22A7Ee 

Total 

+ k^ ~l 

79 

3608.54 



In finding the error s.s. by subtraction, note that we subtract the com- 
bined effect of rows and columns onZy once. 

4. This leads to the adjusted treatment totals. Compute 

q = ^ErE, - E,^ = (16)(68.45)(37.31) - (22.67)2 = 40,348 


e = (fc - l)g = 121,044 


,, (Er ~ 

Ee)(kEc Ef.) 

(45.78) (126.67) 


a 

121,044 

, iEc~ 

E,)ihEr - Bo) 

(14.64) (251, 13) 

ix' = 


: 121,044 


0.04787 

0.03037 


The adjuvsted treatment totals are 

T ■+• X'L' + n'M' 

and are inserted on the right of the M' column m table 12,6. For the 
means, divide by t. 

5. The error variance of the difference between two adjusted means is 


• [1 + /c(X' + ,10] 


2(22.67) 


[1 + 4(0.07824)] = 11.9 


To obtain an approximate F-test of the treatments, compute the sum 
of squares of deviations of the adjusted treatment totals, and divide by 
(k + 1) (fc2 — 1) to give the mean square on a single unit basis. This is 
tested against the effective error m.s., Ee[l + k(\' + mO]- 
The gain in precision relative to a randomized blocks or lattice design 
is estimated by the procedure for the lattice square with Qt + l)/2 
replications, , , 



12.24 


MISSING BATA 


357 


12.23 Designs with 2{k + 1) Replications. This ease is unlikely to 
be of much jiractical interest except possibly with the 3 X 3 design in 
8 replicates and the 4 X 4 in 10. The analysis is similar to that for (fc + 1) 
replications; the changes will be noted very briefly. 

The partition of degrees of freedom is as follows. 


Replications 

d.f. 

{2k + 1) 
- 1) 

Treatments 

Rows 

Component (a) 

- 1) 

Component (5) 

{k- - 1) 

Columns 

Component (a) 

ih^ - 1) 

Component (h) 

{k^ - 1) 

Error 

{2k - 3)(i-2 - 


Total -- 1 

Corresponding to any row, there is another row with the same set of 
treatments. Find the differences 1)^ between the totals of corresponding 
rows. The sum of squares of deviations of the Dr from their replication 
means, divided by 2/t;, gives component (a) of the rows s.s. Component 
(/>) corresponds to the component that was obtained with {k + 1) rep- 
licates. As before, it has two forms, an V and a J form. Both are found 
in the same way as with (fc + 1) replications, except that the divisors 
must be doubled. The sums of squares for columns are derived similarly. 

The formulae for the adjustment factors are a little more complicated. 
Let Ef be the mean square found by pooling component (a) and the J 
component of the rows s.s., with an analogous definition for Sc. Cal- 
culate 

w 1 ^ 1) 

" E,’ " 2kEr-E/ 2kEc-Ee 

^ ~ k[Wr + Wc + {k- l)Wi] ’ ^ k[Wr + Wo - {k - 

The remainder of the analysis proceeds without change. 

12.24 Missing Data, The procedure for analyzing lattice squares 
w^hen certain observations are missing has been worked out by Cornish 
(12.4). As in the case, of lattices (section 10.13), the method requires 
two different estimates of each missing observation, one being that found 
by minimizing the intra-row-and-column error, the other beiiig that ap- 
propriate to the case where the data are analyzed as a randomised block 
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experiment. In order to reduce the amount of computation, we present 
a cruder method which should be satisfactory if only .a small fraction of 
the total observations is missing. 

In this approach we estimate each missing observation by the first 
method mentioned above, that is, by minimizing the intra-row-and- 
column error s.s. Thereafter the analysis proceeds as usual: 1 d.f. is 
deleted from the total and intra-row-and-coliimn error s.s. for each miss- 
ing value. Special rules for ^tests are given later. 

The following totals are used in the formulae for inserting estimates 
in place of the missing values. R, C, T, and P are the totals of the row, 
column, treatment, and replication, respectively, that contain the miss- 
ing observation, while G is the grand total. Further, 

Sx = total of all othei* rows and columns in which the treatment with 
the missing value appears (note that this sum does not in- 
clude the row and column which contain the missing value) 

Tx = total (from all replicates) of all other treatments which appear 
in the row or column that contains the missing value (this 
total does not include the treatment that has the missing 
value) 

Since it is important that the definitions of end be correctly 
understood, it is worth working through the numerical example given 
below. 

Experiment with (k + 1)/^^ replications 

_ k{k - 1){R + C) - (fc + 3)P + 21c(k - 2)T + 6G- 2kSx - 2kT^. 

. ; (fc" - 1).^(*— ' 3) , " 

Experiment with (k + 1) replications 

__ k{k - 1){R + C) - (k + 1)P + k(k - 1)T + 3G- kS^ - kT^ 

Experiment with less than {k + 1)/B replications. Because of the lack 
of symmetry in this design, an additional symbol is needed. 

Ux — total (from all replicates) of all treatments that appear in the 
same row or column as the treatment that has the missing 
value. 

The distinction between and Tx should be realized. a total 

over only those other treatments that are in the actual row or column 
that contains the missing value, while Ux is a total over all other treat- 
ments that appear somewhere in the experiment in the same row or 
column as the treatment that has the missing value. Thus is a part 
of It may help to note that is a total over 2{k — 1) treatments, 
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ox 2r{k - 1) observations, where r is the number of replications, while 
Vx is a total over 2r{li ~ 1) treatments, or 2r^ (A; - 1) observations. 

[/«■(?• — 1)(S + C) — r(r + 1)P -{■ k{k — 2) (r — l)!r| 

_ i + (r + !)<? — krSsc ~ krTx + kll^ 1 

(k — 1)(?' — l)(/cr — k — r — 1) 

Example. In the corn experiment in table 12.3, suppose that treat- 
ment (18) in replication I is missing. Here fc = 5, r = (fc -|- l)/2 = 3. 
Omitting the observation 33.3 for this treatment, we find 

P = 126.8; 0=119.9; 5-1-0 = 245.7; 1^ = 721.9 
r = 56.5; 0 = 2156.1 

Sx is the total of the rows and cokunns in which treatment (18) appears 
in squares II and III. Thus 

Sx = 165.1 ■+- 145.3 4- 109.8 + 126.0 = 546.2 

Finally^ Tx is the total for treatments (2), (9), (1 1), and (25), which ap- 
pear in the row that has the missing value, plus that for treatments (5), 
(6), (12), and (24) which appear in the column with the missing value. 
These totals are to be taken from the table of treatment totals. This 
gives Tx = 709.2. Then 

((20)(245.7) - (8)(721.9) + (30)(66.5) | 

1 + (6)(2156.1) - (10)(546.2) - (10)(709.2)i 

^ - — — SS^O-'- 

■■■■ 32 

The exact formulae for /-tests are complicated^ and it appears that any 
good approximate rule would also be rather complicated. The following 
rule is mggestedv though it somewhat underestimates the standard 
errors for the smaller squares and also lacks the simplicity that might be 
desired. For a comparison between the means of two treatments 24. and 
we assign to each an effective amount of replication which depends on 
the number and situation of the missing values. ^Vllen scoring ^4 we 
examine each replicate in turn and assign a score by the following rules. 



Case 

L A missing 

IL A present, and B in the same row or column as A 

i. B missing 

ii. B present 

HI. A. present, and B not in the same row or column as 

i. Other values missing in both row and column 

ii. Other values missing in row or column hut not 

both 

iii. No values missing in row or column 


Score to A 
0 

0 

1 

’ « 

% 
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The effective replication for 4 is of course its total score over all rep- 
lications in the experiment. B is scored similarly. Suppose that in the 
example the mean of treatment (18) were being compared with that of 
treatment (9), which occurs in the same row in square I. The effective 
replication for treatment (18) is 2, since case I arises in square I. The 
effective replication for treatment (9) is also 2, since case II (i) arises in 
square L 
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Plan 12.1 3 X 3 balanced lattice square 
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Flan 12.2 4X4 balanced lattice square 
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* Number of times that two treatihents appear in the same row or column^ 
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Plan W.S 5 X 6 balanced lattice square 

i — 25, A; = 5, r “ 3, rows «= 15, columns « 15, X == 1 
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Plan 12, 7 X 7 balanced lattice square 

t = 40, k — 7, r “ 4, rows = 28, columns = 28, X ~ 1 
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Flan U.5 8X8 balanced lattice square 


i « 64, /c = 8, r « 9, rows === 72, columns 72, X == 2 


Square I 

Square II 

Square III 

1 9 17 25 33 41 49 57 

1 10 19 28 37 46 56 64 

1 44 62 56 27 39 18 13 

2 10 IS 26 34 42 50 5S 

3 11 19 27 35 43 51 69 

4 12 20 28 36 44 52 60 

9 2 51 44 61 30 23 40 

17 60 3 36 29 62 15 48 
26 42 35 4 21 14 63 66 

46 2 17 35 16 53 60 31 

64 23 3 25 64 12 45 34 

55 40 29 4 58 41 11 22 

5 13 21 29 37 45 53 61 

33 68 27 20 5 54 47 16 

28 9 60 63 6 24 38 43 

6 14 22 30 38 46 54 62 

41 26 59 12 63 6 39 24 

37 51 15 42 20 6 32 57 

7 16 23 31 39 47 55 63 

8 16 24 32 40 48 66 64 

49 18 11 60 45 38 ?" 32 

67 34 43 52 13 22 31 8 

19 61 48 14 33 26 7 52 

10 30 36 21 47 59 49 8 

Square IV 

Square V 

Square VI 

1 60 54 40 43 15 26 21 

1 11 20 30 34 48 53 63 

1 59 62 38 42 16 29 23 

62 2 25 11 24 37 62 47 
56 31 3 41 38 20 61 10 

16 2 66 45 59 28 22 33 

21 52 3 39 32 58 9 46 

63 2 32 13 19 36 54 41 

63 28 3 47 40 18 57 14 

39 16 45 4 50 57 19 30 

26 47 38 4 17 13 64 51 

34 15 46 4 49 61 24 27 

44 17 34 55 5 32 14 59 

40 62 31 19 5 4'9 42 12 

48 22 39 51 5 25 10 60 

13 35 23 58 28 6 48 49 

27 53 64 22 9 42 7 36 

43 26 61 16 56 6 36 18 

54 24 10 57 44 35 7 29 

11 33 21 64 31 6 44 50 
30 56 58 17" 12 43 7 37 

18 46 12 29 63 51 33 8 

60 37 41 50 14 23 27 8 

20 45 9 26 62 55 35 8 

Square VII 

Square VIII 

Square IX 

1 32 47 61 22 50 12 35 

1 14 24 31 36 45 51 58 

1 2 3 4 5 6 7 8 

20 2 14 49 39 64 43 20 

12 2 55 48 67 27 21 38 

14 12 16 10 15 9 13 11 
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22 49 3 37 26 63 16 44 
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32 43 33 4 23 10 62 53 

31 27 26 32 SO 29 25 28 

23 36 57 46 5 11 56 26 

35 64 30 18 5 52 41 15 

36 38 37 33 35 34 40 39 

63 63 40 27 10 6 17 46 

47 29 60 9 66 6 34 19 

45 48 44 43 41 47 46 42 

16 41 28: ,34 51 21 7 62 

38 19 53 15 25 44 58 8 

60 20 13 59 46 40 7 25 

61 39 42 54 11 17 28 8 

51 55 49 53 62 66 60 54 
58 67 63 62 64 60 59 61 
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Plmi 12.6 X 9 balanced lattice square 


t = 81, .= 9, r 6, rows 
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11 X H balanced lattice square 
t == 121, = 11, r == 6, rows = 60, columns =« 66, X - 1 
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PLANS 

{Coniinued) 11 X 11 balanced lattice squai-e 
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86 

63 

40 

17 
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92 
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LATTICE SQUARES 
13 X 13 balanced lattice square 


Plan m.s 

t = 169, A; = 13, r 7, rows == 91, columns - 91, X - 1 
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45 

57 


70 
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34 

46 
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83 
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li 

23 
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Pl(m 12.8 {Continued) 13 X 13 balanced lattice 

[.. ■ Square III 

44“"~54'~~77 87~ of 107 130 140 150 “loO 

161 ”2 25 35 45 55 78 88 98 108 118 141 ~i51 

i 152 162 3 26 36 46 56 86 89 99 lOO 119 142 
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i 103 113~ 123 133 156 166 7 17 27 50 S) 70 80 

"Sl 104 114 124 134 144 167 8 Ts 28 51 61 71 

72 82 92 115 125 135 145 168 9 19 29 52 ' "m 

"“eS 76 93 116 126 136 146 Iffii 10 20 30 « 

"ii 64 74 84 94 117 127 137 . 147 157 11 21 31 
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I ; Square IV 
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|:V: - 136 144 165 4 25 38 ■ 41 62 70 , 91 99 107 128 

ty 129 137 145 166 5 26 34 42 63 71 79 ^^^1^^ ^ 

I--''' 109 130 1 38 146 167 6 64 35 43 64 72 80 101 

I; 102 no 118 1 39 147 168 7 15 86 44 65 73 81 

82 103 111 119 140 148 169; 8 16 , 37^ 53 74 

|v V ';' r; ’: 75 83 104 112 120 141 149 157 9 17 , 38 46 ; 54 

I Vy ; 55 76 84 92 113 121 142 150 158 ; 10 18 39 4^ 

i ' ■ V : 48 56 77 85 93 114 122 143 151 ,159 11 ,19 

|; V 28 49 , 57 78 86 94 115 123 131 152 16 0 12 20 

1 ”21 a) 50 58 66 87 96 116 124 132 153 161 13 
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Plan 12.8 (Continued) 13 X 13 baliuicod lattice squiire 
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Plan {Coniinucd) 13 X 13 balanced lattice square 
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CHAPTEE 13 

INCOMPLETE LATIN SQUARES (YOUDEN SQUARES) 

13*1 Description 

13*11 Youden Squares, These designs, which are constructed by a re- 
arrangement of certain of the balanced incomplete blocks, possess the 
characteristic '"double control” of the latin square, without the restric- 
tion that the number of replicates must equal the number of treatments. 
A latin square with 13 treatments is rarely used, because it necessitates 
13 replications. There are, however, incomplete latin squares for 13 
treatments in either 4 or 9 replicates. Every treatment occurs once in a 
column (replication), and every pair of treatments appears together an 
equal number of times in the same block. Column differences are 
eliminated automatically from the treatment comparisons, while block 
differences may be removed by adjusting the treatment mean yields in 
the same way as with balanced incomplete blocks. Most of the designs 
were developed by Youden (13.1, 13.2), whose name is commonly as- 
sociated with them; previously Yates (13.3) had drawn attention to the 
group of designs in which the number of replicates is one less than the 
number of treatments. Table 13.1 summarizes the designs for numbers 
of treatments up to 37. 


TABLE 13.1 >SiiMMARy op incomplete latin squares (r less than t) 


t 

T — k 

X 

B 

Type t 

i 

r - k 

X 

E 

Type t 

4* 

3 


89 

II 

13 

4 

1 

81 

I 

0 * 

4 

3 

94 

11 

13 

9 

6 

95 

I 

6* 

5 

4 

96 

11 

15 

7 

3 

92 

I 

7 

3 

1 

78 

II 

15 

8 

4 

94 

I 

7 

4 

2 

88 

11 

16 

6 

2 

89 

T 


6 

5 

97 

11 

16 

U) 

6 

96 

I 

8* 

7 

6 

98 

II 

19 

9 

4 

94 

I 

C) * 

8 

7 

98 

II 

19 

10 

5 

95 

I 

10* 

9 

8 

99 

11 

21 

5 

1 

84 

I 

11 

5 

2 

88 

I 

31 

6 

1 

86 

I 

11 

d 

3 

92 

I 

37 

9 

2 

91 

I 

11 * 

10 

9 

99 

II 







* Constructed from at Xt latin square by omission of the last column. " 
t This refers to the method of analysis (see section 13.2), 
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:: 13,2 ' ■ ; ' BTATISTIGAL ANALYSIS 

Tlie number of re|>lieate.s is always equal to the number of units per 
block. The syrnl>ol X shows the number of times that two treatments 
appear together in ti blo(*k, wiiile E is the effieienc}?^ factor expressed in ^ 

percentag(\ :| 

Plans an' given (plans 13..1-13.15) for all designs in table 13.1 except 
those marked with tm asterisk (*), which are constructed from latin I 

squares l>y the iunission of 11k‘ last column. A 5 X 5 square, for in- I 

stance, furnishes the plan for i — 5, r = 4. | 

An interesting lipirlieatiou f)f the design for the control of plant vari- | 

ability was made by Youden (13.1) in greenhouse experiments on to- 
bacco-mosaic virus. The ex})erimental unit was a single leaf, and the 
data consisted of the number of lesions produced per leaf by rubbing the 
leaf with a solution which contained the virus. The numbers of lesions 
had been foimd to depend much more on inherent qualities of the plant 
than on tlie position of the plant on the greenhouse bench. Conse- 
qiientl}^ each block of the design was a single plant, so that the large dif- 
ferences in responsiveness which e.xisted among plants did not contribute 
to the axperimeiital errors. The columns were the positions, from top 
to bottom, of the five leaves which were used on each plant. That is, 
the first replication contained the top leaf of e^^ery plant. Since there 
was a fairly con.sisi:ent gradiemt in responsiveness do™ each plant, this 
control also pnjved effective. 

As in the example iibove, when laying out an incomplete latin square 
the geiuu'al prhudph^ is to group the units so that differences among 
Iffocks and difference's anumg columns represent the major sources of 
variation that are known or suspected, 

13,12 Randomization. The steps arc 

1. Rearrange the blocks of the plan at random. 

2. Rearrange the replications of the plan at random. 

When a plan is repeated for additional replication, blocks and columns 
are randomized separately within each repetition. The repetitions may 
be kept separate, 

13.2 Statistical Analysis 

In experiments with more than 10 treatments, the numbers of degrees 
of freedom for blocks are large enough to allow the use of inter-block in- 
formation, The appropriate analysis is described in the next section. 

For small experiments, where inter-block information should be ignored, 
the analysis is given in section 13.22. The recommended method of 
analysis for any experiment is indicated both in table 13.1 and in . the 
plans by the *Type^^ symbol (I or 11)1 , . 
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INCOMPLETE LATIN SQUARES (YOUDEN SQUARES) 13.21 


13,21 Type I. Analysis with Recovery of Inter-’block Information. 
The steps are as follows (i ~ number of treatments; k — number of units 
per block — r). 

1. Calculate the treatment totals T, the column totals, the block 
totals, and the grand total 6. 

2. For each treatment, calculate the total Bt of all the blocks which 
contain the treatment. Place these quantities in a column next to the 
column of treatment totals. Then form a third column of the quantities : 

W = (t - h)T - {t - l)Bt+ (k - l)G 
The TPs should sum to zero. 

3. The separation of degrees of freedom in the analysis of variance is 
as follows. 



d.f. 

m.s. 

Columns (replications) 

(fc - 1) 


Blocks (eliminating treatments) 

it - 1) 

Ei 

Treatments 

it - 1) 


Error 

(fc - 2)(f - 1) 

Ee 

Total 

{tk - 1) 



The total s.s. and the sums of squares for columns and treatments are 
found by the usual methods. The sum of squares for blocks (eliminating 
treatments) is the sum of squares of the TF\s, divided by kt(t — k)(k — 1). 
The error s.s. is obtained by subtraction. 

4, Calculate the factor 

(B, - Ee) 

^ tik - l)Eb 


The adjusted total for any treatment is 

F - T + mTF 

Should Eb be less than Ee, jx is taken as zero and no adjustments are 
made to the treatment totals. The adjusted treatment means are ol>~ 
tained on dividing each Y by r. The estimated error variance of the dif- 
ference between two adjusted treatment means is 


2Ee 




13.22 Type IL Analysis without Recovery of Inter-block Information. 
The analysis proceeds as follows. 
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13.23 TYPE la. BEPETITIONS OF TYPE I 

1. Calculate the column totals, the block totals, the treatment totals 
2V and the grand total G, 

2. For each treatment, obtain the quantity 

Q -Bt 

where Bt is the total of all the blocks in which the treatment appears. 
The quantities Q should sum to zero. 

3. The degrees of freedom in the analysis of variance are partitioned 
as in section 13.21. In this case the blocks s.s. is calculated without elim- 
inating treatment effects, while the treatments s.s. is adjusted for block 
differences. The total s.s. and the sums of squares for blocks are com- 
puted by the standard procedure. The treatments s.s. (adjusted) is the 
sum of the squares of the Q's, divided by tk\. The error s.s. is found by 
subtraction. 

4. In order to obtain any treatment mean, adjusted for block dif- 
ferences, divide the corresponding Q by i\ and add to the quotient the 
mean for the whole experiment. 

The estimated error variance of the difference between two adjusted 
treatment means is 

2E,{t ^ 1) 

(r - l)t 

Numerical examples of this analysis for i = 7, r ^ 3, and = 21, 

— 5 are given in reference (13.1), and an example for t = 6, r - 5 in 
(13.3). 

13.23 Type la. Repetitions of T 3 rpe I. Let n be the number of repli- 
cations in the basic plan, which is used p times in an experiment, so that 
the total number of replications r — np. 

The method of analysis presented here may be used with any of tho 
designs in table 13.1, since even with small numbers of treatments the 
repetition provides sufficient degrees of freedom for estimating the inter- 
block variation. If, however, the efficiency factor exceeds 95%, it is 
scarcely worth while to utilize inter-block information. 

The changes necessary in the procedure of section 13.21 are outlined 
below, the numbers referring to the steps in that section. 

1. Unchanged. Corresponding to any block, there are (p — 1) other 
blocks which contain the same set of treatments. The block totals 
should be arranged in a table (table A, say) with t rows and p columns, 

2. Unchanged. Note that the calculation of the quantities Bt is ex- 
pedited by the use of the row totals of table ^1. - ' , 
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3, The analysis of variance is as follows. 

d.f. m.s. 

Columns (replications) {kp — 1) 

Blocks 

Component (a) (p — l)(j5 — 1) 

Component (6) — D 

Block total pik — 1) & 

Treatments (t — l) 

Error (t — l)i'pk - p — 1) Ee 

Total {pkt- 1) 

In this analysis, the (p —T) degrees of freedom among repetitions 
have been ascribed to the columns, so that the blocks s.s. is actually a 
sum of squares among blocks within repeUtions. 

Component (a) of the sum of squares for blocks consists of comparisons 
among blocks which contain the same set of treatments and is the inter- 
action s.s. for table A. 

Component (b) is the sum of squares of the IF^s, with a divisor 

4. Unchanged except that 

■ p(Ei, - Ee) 

^ " 'pt(k ~ l)Eb -it- k)ip -^l)Ec 

The estimated error variance of the difference between two adjusted 
treatment means is 

■ ■ , 2E 

— [1 + («-«)m] 

' ■■ 

13.24 Type Ila. Repetitions of Type IL As we have pointed out, 
in designs where the efficiency factor is high, inter-block information 
can be neglected. The changes required from section 13.22 should 
present no difficulty. The degrees of freedom subdivide as in section 
13.23. The p{t — 1) degrees of freedom among blocks within repetitions 
are calculated in the usual way, each repetition furnishing {i — 1) de- 
grees of freedom. Notice that the. divisors for the treatments s.s. and 
for the quantities Q must be increased by the factor p. 

13.26 Missing Data. The formula for inserting an estimate in place 
of a missing observation is obtained by minimizing the intra-block error. 
This is the correct estimate for the smaller experiments where inter- 
block information is ignored. For reasons given in section 10.13, this 
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DESIGNS FOR SMALL NUMBERS OF TREATMENTS 
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estimate will also be used to provide a, n approximate solution when 
inter-block information is recovered* The formula is quite similar to that 
for balanced incomplete blocks, with the addition of an extra term in- 
volving the column (or replication) total. 

Let C, iJ, and T be the totals of the column, block, and treatment that 
contain the missing value, and let G be the grand total. Further, as in 
the statistical analysis, let be the total of all blocks in which the 
treatment with the missing value appears (there will, of course, be r such 
blocks). Finally, 

r" = total (over all replicates) of all other treatments that appear in 
the block which has the missing value 

Bt — total of the values for all ofAer treatments that appear in the 
block with the missing value 

The estimate x of the missing value is 

\[rC + tB + it - 1)T - G\ - tT - (r - l)Bt + B/ 
r(7' — l)(r — 2) 

The symbol X is given for each design in table 13. L Care must be 
taken not to confuse the block totals with the column totals. 

If the experiment contains p repetitions, the formula is 
^ plpkX C + pf^B + k{ k - l)T - pUi - hr -- jk - l)Bt + B/] 
r{k — l)(pfe — p “• 1) 

where R is the total of the repetition in which the missing value occurs, 
and X is as in table 13.1. 

13.3 Other Designs for Small Numbers of Treatments 

13.31 Description. When the number of treatments is small, it is 
sometimes useful to have a design of the ‘datin square” type in which 
the number of replicates exceeds the number of treatments. Some de- 
signs can be obtained by repetition of an ordinary latin square or of a 
Yoiiden square. Additional plans can be constructed by adding a Yoii- 
den square to a latin square. A selection of these designs is shown in 
table 13.2 for numbers of treatments up to 7, 

. It will be noted that plans are available for most numbers of replicates 
up to 10. Actually, plans can be made for any number of replicates up 
to 10; those shown here have been selected for ease of analysis. 
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TABLE 13.2 Incomplete latin squares foe small numbers op treatments 

0 ’ > 0 


t 

k 

r 

E 

Ref. to 
plan 

Typet 

t 

h 

r 

E 

Ref. to 
plan 

Typet 

3 

5 

5 

96 

13.16 

III 

4 

9 

9 

99 

13.22 

IV 

3 

3 

6 

lOG 

2L.S. 


4 

5 

10 

96 

13.20 

Ila 

3 

"T 

7 

98 

13.17 

IV 

5 , 

6 

6 

97 

13.23 

IV 

3 

8 

8 

98 

13.18 

III 

5 

4 

8 

94 

* 

la 

3 

3 

0 

100 

3L.S. 


5 

9 

9 

99 

13.24 

III 

3 

',,5 

10 

96 

13.16 

Ila 

5 

5 

10 

100 

2L.S. 


3 

10 

10 

99 

13.19 

IV 

6 

7 

7 

98 

13.25 

IV 

4 

5 

5 

96 

13.20 

IV 

6 

5 

10 

96 


la 

4 

3 

6 

89 


la 

7 

4 

8 

88 

13.2 

la 

4 

7 

7 

98 

13.21 

in 

7 

8 

8 

98 

13.26 

IV 

4 

4 

8 

100 

2L.S. 


7 

3 

9 

78 

13.1 

la 

4 

3 

9 

89 

* 

la 








* By repetition of the plan for r — ^ — 1, which is constructed by talcing a. i X t 
latin square and omitting the last column. 

t This refers to the method of analysis (see section 13.2). 


TABLE 13.3 Incomplete latin square .for 4 treatments in 7 replicates 
Treatment symbols and yields of tomatoes (pounds) 

Column 


Block 


II 

III 

IV V 

VI 

VII 

Total 

■ . '■ i"' 

' 2 

■ 2 

4 

;:.^4 ' ' ■ 3'' 

3 

1 



50 

72 

83 

82 76 

89 

74 ■ 

526 

2 

1 

3 

3 

1 4 

2 

■ 4' 



40 

59 

71 

91 59 

73 

52 

445 

3 

4 

1 

2 

3 1 

4 

2 



43 

67 

58 

98 54 

71 

51 

432 

4 

3 

4 

1 

2 2 

1 

3 



48 

54 

74 

97 75 

76 

54 

477 

Totals 

. 181 

242 

286 

368 264 

308 

231 

1880 







Adjusted means 




T 

Q - kr 4- B' 

Q 48 (subtract 11.2) 


1 


465 

3,781 

78.8 


67.6 


2 


476 

3,777 

78,7 


67.5 


3 


495 

3,897 

81.2 


70.0 


4 


444 

3,585 

74.7 


63.5 

. Check totals or mean 

1,880 

15,040 



67.1 
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13.32 TYPE HI. STATISTICAL ANALYSIS WHEN i'l! 1 


13.32 T 3 rpe III. Statistical Analysis When it - 1. In this ease 
the size of block, which equals the number of replicates, is 1 less than 
some multiple {i) of the number of treatments. Since these designs have 
not been discussed in the literature, an example is given from uniformity 
data. Table 13.3 shows the tomato yields in pounds of 28 single-row 
plots, liartman and Stair's data (13.4), on which the design for 4 treat- 
ments in 7 replicates has been superimposed. 

Since the plots measured 6 feet X 24 feet, each column (replication) 
is compact, being 24 feet square. There are obviously substantial dif- 
ferences among replicates. The blocks, although very oblong, may also 
exhibit differences in yield, because each block is a separate row of plants. 

The efficiency factors of these designs are all very high, so that inter- 
block information is ignored in the analysis. 

The steps in the analysis are as follows: 

1. Calculate the column totals, the block totals, the treatment totals 
T, and the grand total G. 

2. A property of these designs is that each treatment is replicated less 
in one block than in the other blocks. Thus, in the example, treatment 
(1) appears only once in the first block, but twice in all other blocks. 
Similarly, treatment (2) is deficient in black 2 and so on. For each 
treatment calculate the quantity 

Q ^ kT + B' 


where is the total for the block in which the treatment is deficient. 
For instance 

= 7 X 465 + 526 - 3781 

As a check, the quantities Q should sum to [k + 1)(?. 

3. All sums of squares in the analysis of variance are obtained in the 
usual way except that for treatments, which is given by the simr of 
squares of deviations of the Q's, divided by k{k^ — 1). In this case we 

have,:.'.-:' 


(3781)^+ (3777)^ + (3897)^ + (3585)^ - ^(1 5,040)^ 


150 


The analysis of variance is shown below. The degrees of freedom sub- 
divide as in section 13.21. 



d.f. 

s.s. 

m.s. 

Columns (replications) 

6 

5387 

897.8 

Blocks 

3 

750 

250.0 

Treatments 

3 

150 

60.0 

Error 

15 

766 

61.1 

Total . . ■ 

27 

7053 
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The elimination of block differ has substantially reduced the 
error m.s. 

4. To obtain the adjusted treatment mean yields, we first divide each 
Q by (/c^ - 1), in this case 48. From the resulting quantities we sub- 
tract — 1), or 1880/168 ~ 11.2. As a further check, the mean 

of the adjusted treatment means should equal the mean yield of the 
whole experiment (67.1). The efficiency factor is (k^ — l)/k^, and the 
estimated error variance of the difference between two adjusted treat- 
ment means is 

2E^ 

13.33 Type IV. Statistical Analysis When ft - it + 1. The analysis 
closely resembles that of section 13.32. For each treatment there is one 
block in which the treatment has extra replication. Thus, in plan 13.17 
for ^ = 3, r == 7, treatment (1) appears 3 times in the first block but only 
twice in any other block. The only changes in the computing instruc- 
tions of section 13.32 are (i) Q = kT — 7?', where B' is the total of the 
block in which the treatment has extra replication, (ii) the sum to 
(k — 1)G, and (iii) for the adjusted treatment means, divide Q by 
(k^ — 1)^ and add to the quotient the quantity G/tk{k — 1). The chauge 
in the divisor of G should be noted. 

The error variance of the difference between two adjusted means re- 
mains as in section 13.24. 
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Plan 184 t = n, k = 6, f = 6, 6 = 11, X = 3, E = 92, Type I 


Reps. 
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PlanlS.ti j = 31, fc = 6,t = 6, 31, X = 1, 5 = 86, Type! 
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Plan 13.16 t = 3, i = 6, r = 5, fi = 3, X = 5, ® = 96, Type III 
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CHAPTER 14 


ANALYSIS OF THE RESULTS OF A SERIES OF 
EXPERIMENTS 

14.1 Initial Steps in the Analysis 

14.11 Introduction. In a program of research it is quite common to 
repeat the same experiment at a number of different places, on a number 
of different occasions. There may be several reasons for this. Some- 
times the object of the research is to produce recommendations which 
are to apply to a population that is extensive either in space or in time 
or in both. Thus in agricultural field experimentation, many projects 
are undertaken in the hope that their results can be applied in practical 
farming. The conclusion»s drawn from such research, if they are to be of 
use, must be valid for at least several seasons in the future and over a 
reasonably large area of farm land. It has been found that the effective- 
ness of the common plant nutrients, of different varieties of a crop, and 
of different cultivation practices usually varies from field to field and, 
even more markedly, from season to season. A single experiment, how- 
ever well conducted, supplies information about only one place and one 
season. Consequently such experiments are carried out at several dif- 
ferent places in the area for which recommendations are wanted, and are 
repeated for a number of seasons. 

In other cases we may be interested, not in making inferences about 
some specific population, but in studying the influence of external con- 
ditions on some measurement or on the responses to treatments. For 
example, is the vitamin A content of a vegetable affected by the climate 
in which it is grown? Do the relative durabilities of two types of road 
surface depend on weather or topography or traffic load? Repetition of 
such experiments in different places is necessary in order to have vari- 
ations in the external factors that are under investigation. 

A third example is supplied by collaborative experiments on biological 
assay by a group of laboratories. Here the objects may be to obtain in- 
formation about the accuracy with which the potency of a drug can be 
estimated and to discover whether different laboratories reach the same 
conclusions about the relative potencies of different drugs. Bliss (14.1) 
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gives an interesting account of a serie>s of experiments of this type, where 
sixteen laboratories estimated the potency of preparations of digitalis by 
injection into cats. 

The appropriate statistical analysis for the data from a series of ex- 
periments will of course vary with the object of the research. Never- 
theless, the preliminary stages of the analysis tend to be the same in all 
cases. In this chapter an introductory account is given of these initial 
procedures, which present difficulties that are not always appreciated. 
The first step, which is to analyze and interpret the data from each in- 
dividual experiment, will be assumed to have been completed. 

When we begin to combine the data, the first point of interest is to 
examine whether the differences among treatments are the same in all 
experiments. This question is likely to be relevant whatever the purpose 
of the experiments. In experiments designed to lead to the recommenda- 
tion of a ^ffiest” treatment for some operation, we wish to know whether 
there is a consistent superiority of certain treatments, or whether on the 
other hand we may have to consider recommending different treatments 
for different circumstances. In the other two types of experimentation 
mentioned above, the test would indicate whether the responses to treat- 
ments have varied with the external conditions of the experiment, or 
whether the different laboratories agreed in their estimates of relative 
potency. 

Secondly, it is often, though not always, desirable to estimate and 
compare the average effects of treatments over the whole series of ex- 
periments. 

14.12 Numerical Example. The data come from a group of 6 experi- 
ments on Irish potatoes conducted in two counties of North Carolina in 
1945 and 1946. The results have been described by Nelson and Hawkins 
(14.2), and form part of a study of the responses to applications of super- 
phosphate on soils of varying degree of fertility. In 1945, on each of the 
6 sites, the amount of readily soluble phosphorus in the soil itself was 
estimated from soil samples by the modified Truog method. These 
amounts varied from 48 to 850 lb. P2O0 per acre. The treatments com- 
prised 5 different levels of application, 0, 40, 80, 120, and 160 lb. P2O5 
per acre. 

These experiments are an example of the second type discussed above, 
in that their object was to find out the extent to w’-hich the responses to 
treatments were influenced by the condition of the soil. The soils were 
not intended to be a representative sample of the soils in the counties, 
but were chosen so as to give a wide range in fertility. 
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Potatoes were grown on the plots in 1945 and 1946, the treatments 
being applied in both years, with in addition substantial applications of 
nitrogen and potash on all plots so that these nutrients would not be 
deficient. The data to be analyzed are the 1946 results. The mean yields 
and error m.s. per plot are shown in table 14.1. In five of the six experi- 

TABLE 14.1 Tebatment yields (100 lb. per acee) of Ieish potatoes, 1946 


Experiment number 


Pounds P 2 O 5 

1 

1 2 

i 3 1 

! i ' 

1 5 1 

1 6 

applied 
per acre 

Amount of readily soluble phosphorus 
(pounds P 2 O 5 ) in soil, 1945 

48 1 310 1 410 1 710 1 790 | 850 

0 

114 

142 

180 

170 

130 

170 

40 

206 

176 

188 

171 

140 

178 

80 

231 

201 

207 

188 

150 

187 

120 

237 

205 

208 

185 

152 

188 

160 

252 

217 

217 

189 

156 

189 


106 

83 

158 

43 

204 

52 

d.f. 

21 

12 

12 

12 

12 

12 

. . f t, ; 

4;. 

5 

5 

/. ■-:5 

..■■'6 ■■ 

5:V-. 


* Error in.s. per plot. 

t Number of replications. 

ments the F-ra,tios for treatments against error were significant. One 
feature of the results in table 14.1 is that experiment 1, which has the 
lowest amount of available phosphorus in the soil according to the soil 
tests, actually gave the highest total yield in 1946. It should be re- 
membered that the soil tests were made in 1945, and that some residual 
effects of the 1945 applications may have persisted. Whatever the rea- 
son, this soil was apparently very responsive to phosphorus in 1946; the 
plots without phosphorus do have the lowest yield of all places. 

It is possible to compute from table 14.1 a combined analysis of vari- 
ance for all six experiments. As will be seen later, there are limitations 
to the use of such an analysis. For the present we will ignore any com- 
plexities that may arise. Further, in order to discuss only the simplest 
case at first, we will omit experiment 1, which differs in size and structure 
from the other experiments. This omission is of course inappropriate 
from the agronomic point of view, but the data are being used to throw 
light .on the general procedure in analysis rather than on the agronomic 
questions involved. 
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14.13 Preliminary Combined Analysis When all Experiments Have the 
Same Design. Formally, the analysis subdivides into the following 
components. 

Places 
Treatments 
Treatments X places 
Pooled experimental eiTor 

In addition, in a complete analysis there are terms representing the dif- 
ferences among rows and columns in the individual experiments. Since 
these are not relevant to the interpretation, the}^ are not included. 

The analysis can be computed from either treatment means or totals; 
the totals will be used here and are shown in table 14.2. 


TABLE 14.2 Treatment totals for experiments 2 to 6 (in 100 lb.) 


Pounds 

P 2 O 6 

2 

E 

3 

xperimei 

4 

it 

■6 1 

6 

Total 

0 

710 

900 

850 

650 

860 

3,960 

40 

880 

940 

855 

700 

890 

4,265 

80 ' 

1,005 

1,035 

940 

750 

935 

4,665 

120 

1,025 

1,040 

925 

760 

940 

4,690 

160 

1,085 

1,085 

045 

780 

945 

4,840 

Total 

4,705 

5,000 

4,515 

3,640 

4,560 

22,420 


The first 3 components are computed in the same way as for a single 
randomized l;>locks experiment, except that all sums of squares are di- 
vided by an extra 5 in order to reduce them to a single-plot basis. Thus 
we'have ; ■ 




25 


125 


^ ^ ^ (3960)^4- ••*+(4840)2 (22,420)2 __ 

Treatments: : 7 ;: — ~ — - 20,979 


26 


126 


To obtain the treatments X places s.s. we calculate the total s.s. for 
table 14.2, which comes to 69,199. Then 


Treatments X places: 69,199 - 41,367 - 20,979 = 6853 


Since all experiments have the same number of error degrees of free- 
dom, the pooled error m.s. may be found as the simple average of the 
five values in table 14.1. This comes to 108, as given in table 14.3. 

The null hypothesis that the treatment differences are the same at all 
places (i.e., that there are no treatment X place interactions) is tested 
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by the F-ratio 428/108, or 3.96, with 16 and 60 d.f., respectively. >Since 
the 5% level is 1.81, the ratio is definitely significant, 

TABLE 14.3 Combined analysis op vaetance (on a single-plot basis) 


Source of variation 

d.f. 

s.s. 

m.s. 

Places 

4 

41,367 


Treatments 

4 

20,979 

5,245 

Treatments X places 

16 

6,853 

428 

Pooled error 

60 

6,480 

108 


A test of the average responses to treatments taken over all five places 
is of minor interest in this example, because the places do not constitute 
a random sample from a population about which we wish to make in- 
ferences. For purposes of illustration, however, we will assume that the 
places were selected as a random sample of fields on which potatoes might 
be grown commercially. 

In the F~test of the average effects of treatments there are two possible 
candidates for the denominator of F— the mean square for the treat- 
ments X places interactions (428) or the pooled error m.s. (108). Some- 
times this competition does not arise, because our knowledge of the data 
and the F-test of the interactions both indicate that there is no reason to 
suppose real interactions to be present. In that event the mean squares 
for interactions and error may be pooled to form a single denominator 
for the F-test of treatments. But frequently the experimental conditions 
are such that we expect interactions to be present. This is so in the ex- 
ample, where previous work in a number of countries has shown a rela- 
tion between the response to phosphorus and the amount found in the 
soil by a soil test. Consequently, even if the interactions m.s. had not 
proved significant, we might have been unwilling to assume that there 
were no interactions. 

In discussing the two F-ratios it is helpful to examine the mathematical 
model on which the combined analysis is based. If Xij is the observed 
mean for the jth treatment at the ^th place, we postulate that 

Xij — + 'iri + Tj + + % (14.1) 

where Tj represent the effects of the place and the treatment, respec- 
tively, }Xij that of the treatment X place interaction, and that of the 
experimental error. % is the average of the errors on the r plots that re- 
ceive the treatment at that place. 

With this model we can study the nature of the mean squares in the 
analysis of variance. If the experimental errors on individual plots have 
a variance and if the interaction terms may be considered to have 
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a variance the average values of the mean squares work out as fol- 
lows, 

TABLE 14.4 Expected values of mean squaees 
Source of variation Expected value of mean square 

Treatments v/ 4- ^ 

Treatments X places -h ror^^ 

Pooled experimental error 

The symbols r, p, and t stand respectively for the numbers of replica- 
tions, places, and treatments. 

It will be noted that the treatments m.s. is influenced by three com- 
ponents — ^the experimental error variance, the variance of treatment X 
place interactions, and the variance among the true treatment means 
Tj. The pooled experimental error takes account of only the first of these 
components. Hence the treatments m.s. may be statistically significant, 
as compared with the pooled error m.s., either because there are xeal dif- 
ferences among the r^s or because treatment X place interactions are 
present. This J^-test is informative only when we are indifferent as to 
whether a significant treatments m.s. was due to real treatment dif- 
ferences or to interactions. This situation is rare in practice. In most 
cases, on the contrary, having established that interactions are present, 
or at least that it is not safe to assume them absent, we wish to know 
whether in addition there are consistent differences among the effects of 
treatments. As table 14.4 shows, the appropriate denominator of F for 
this test is the interactions m.s. This contains both the interaction and 
error components of variation in exactly the same way as they enter into 
the treatments m.s. The F-ratio, 5245/428, or 12.26, with 4 and 16 d.f., 
is significant at the 1% level 

The conclusions from this initial analysis are: (i) there are real dif- 
ferences in response that are consistent from place to place, and (ii) there 
are real variations in responsiveness from place to place. It need not be 
stressed that these statements do not constitute a competent summary of 
the results. They merely indicate what should be examined next. In 
the present case we would consider which treatments have proved con- 
sistently superior, and, if possible, why this is so. Similarly, we should 
investigate the nature of the interactions and try to find a rational ex- 
planation for them. 

14.2 Criticisms of the Preliminary Analysis 

14.21 Heterogeneity of the Interaction Variance. The previous anal- 
ysis is open to several criticisms. Although these criticisms are not valid 
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for all series of experiments, our experience is that it is well to be on one's 
guard against them. They, deal essentially with the assumptions on 
which the combined analysis was based. 

These assumptions have been presented in section 14.13. Specifically, 
we postulate that the model (14.1) holds. Further, the experimental 
errors (Hj of individual observations are assumed to be normally and in- 
dependently distributed with the same variance o*/. Finally, for the 
F-test of the treatments m.s. against the treatments X places inter- 
actions, we require also the assumptions that the interaction terms 
are normally and independently distributed, with zero population means 
and the same variance and are independent of the c's. 

The first criticism is that some components of the treatments X places 
s.s. may be much larger than others, or in mathematical terms that the 
^'interaction” variance is not constant. This will happen if the ef- 
fectiveness of some treatments varies greatly from place to place, while 
that of others varies less or not at all. 

If the interaction m.s. is heterogeneous in this sense, the F-test of 
treatments against interactions is vitiated. The general effect is that the 
F value read from the tables is too low, i.e., that too many significant re- 
sults are obtained. Some idea of the extent of the bias can be obtained in 
certain extreme cases. In the numerical example, if one comparison 
among the treatment means has a much larger interaction variance than 
any other comparisons, the F-ratio is distributed approximately as an 
F value with 1 and 4 d.f., respectively, instead of 4 and 16 d.f. The 5% 
significance level would be 7.71 instead of 3.01. More general^, if there 
are p places, F is distributed approximately with 1 and (p — 1) d.f. 
This situation produces about the greatest distortion in F that is likely 
to arise, so that 7.71 could be regarded as an upper limit to the signifi- 
cance level of F. 

The exact distribution of F can be worked out, but is not available in 
a form adapted for practical use. Even so, in many instances, uncer- 
taint 3 ^ about the correct significance level for F does not preclude us from 
drawing conclusions from the test. For, in the example above, if F 
turned out to be 1.38 we would be confident that it is not significant, since 
the 5% level is at least 3.01. Similar^, there seems little doubt that the 
F value actually obtained in the experiments, 12.26, is significant, since 
it lies well above 7.71. On the other hand, we would be uncertain about 
significance if F happened to lie between 3 and 7. 

A better method of coping with this difficulty is to divide the treat- 
ment s.s, into a set of orthogonal components that will supply all or most, 
of the information of interest to us. The interactions s.s. is partitioned 
in the same so as to isolate the interactions of each component with, 
places. By Bartlett's test for homogeneity of variance, reference (14.3), 
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we can then test whether is the same for all the components of the 
interactions s.s. If we decide that cr/ can be assumed constant, the 
difficulty vanishes. If is not constant, it is valid to test any com- 
ponent of the treatments s.s. against fe own interaction with places. 
Apart from the extra computations, the only drawback to this pro- 
cedure is that the degrees of freedom in the denominator of F are reduced. 

These remarks will be illustrated by the example. In these experi- 
ments the principal point of interest is to see whether the average re- 
sponse to a given amount of phosphorus decreases when the amount 
presumed to be in the soil increases. It may also be worth while to ex- 
amine whether , the rate of decline in response with the higher levels of 
dressing changes with the nature of the soil. These questions can be 
considered conveniently by fitting at each place a parabolic regression 
of the yield on the amount of dressing. Although a parabolic regression 
would scarcely be regarded as the true form of the response curve, it ap- 
pears to fit the data well. 

Since the dressings increase by equal amounts, the orthogonal poly- 
nomials given by Fisher and Yates (14.4) are suitable. The calculations 
are shown below for experiment 3. The linear term is proportional to 


Multipliers for 


Amount 

Total 

Linear 

Quadratic 

of P 2 O 5 

yield 

term 

term 

0 

900 

—2 ■ 

2 

40 

940 

-1 


SO 

1035 ‘ 

0 

-2 

120 

1040 

1 

-1 

160 

1085 

2 

2 

Sum of products 

470 

-80 

Divisor for square 

50 

70 


the average increase in yield per 40 lb. of P 2 O 5 : the quadratic term may 
be interpreted as measuring the rate of decline in response with increased 
dressings. The values found for the two terms are as follows. 

■ ■.■■Experiment./. 



■ 2 ,': ■■'"■ 

3 

4 

5 

6 

Total 

L 

. 896.: 

470 

260 

320 

.240.- 

2185 

Q 

^325 i 

-80 

-70 

-100 

-no i 

-685 


The contributions to the treatments s.s. are 

(2185)^ (685)^ 

Linear: = 19,097: quadratic: = 1341 

250 350 


399 


34.22 HETiaiOGENKITY OF EUROR VARIANCES 


In the treatmentH X places s.s. we have 

( 895 ) 2 +...+ ( 240)2 


L X places: 


50 


Q X places: 


(325)2 + ...+ 
70 


- 19,097 = 5893 

- 1341 = 645 


In table 14.5 the three interaction mean squares are in the same order of 
size as the corresponding treatment mean squares. This result is 
typical, in that large effects tend to have large interactions. It is obvious 
on inspection that the components of the interaction m.s. cannot be 
regarded as homogeneous. Accordingly, we test the linear component 
of treatments against the mean square 1473 instead of against the com- 
plete treatments X places m.s, of 428 in table 14.3. This makes a sub- 
stantial difference to the F-ratio, though it remains significant. The 
quadratic X places m.s. (161) is not significantly above the error m.s. 
(108), but it seems prudent to use it as the denominator in the F-test of 
the quadratic term in treatments. The remainder of the interactions 
shows no sign of significance, and could be pooled with error for an 
test of the deviations from regression. 


Tx\BLE 14.5 Subdivision of the analysis of variance 


Source of variation 

d.f. 

s.s. 

m.s. 

Treatments 

Linear 

1 

19,097 

19,097 

Quadratic 

1 

1,341 

1,341 

Deviations from I’cgression 

2 ' 

541 

270 

Treatments X places 

Linear X places 

4 

5,893 

1,473 

Quadratic X places 

4 

645 

161 

Deviations X places 

8 

315 

39 

Pooled error 

60 

6,480 

108 


14.22 Heterogeneity of the Experimental Error Variances. A second 
criticism concerns the assumf>tion that the experimental error variances 
d/ are the same in all experiments. In general this assumption will hold 
only if all experiments have been conducted in the same way, with tlm 
same amount of control over environmental conditions and with experi- 
mental material of the same variability. In experiments with crops or 
animals this degree of uniformity is seldom attainable, because the 
natural variability among pieces of land or among animals at one place 
differs from that at other places. Hence in this and many other t3?'pe3 of 
cooperative experimentation we expect a prim that experimental error 
variances change from place to place. 
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: When there is doubt about this point, Bartlett^s test of homogeneity 
of variances can be applied to the error m.s., in the experiments. In 
the example (table 14.1) the values varied from 43 in experiment 4 to 
204 in experiment 5, and the test of homogeneity shows that o-^.^ cannot 
be assumed constant. 

Variation in a-e^ invalidates the i^»test of the interactions m.s. against 
the pooled error m.s. Although the effect on the significance level of F 
is not known exactly, it operates so that use of the tabular F produces 
too many significant results. As in the previous section, the extent of 
the distortion in F can be seen in extreme cases. If one experiment has 
a much higher error variance than aziy of the others, will be distributed 
approximately as the tabular F with — 1) and W degrees of freedom, 
where tis the number of treatments and is the number of error degrees 
of freedom in the experiment with the high error variance. In the ex- 
ample, if one of the experiments with 12 error d.f. happened to be much 
less accurate than the other experiments, would have 4 and 12 d.f. in- 
stead of 16 and 60 as used in the test in section 14.13. The 5% level 
would be 3.26 instead of 1.81. Since this case appears to be the most un- 
favorable that would occur, we may conclude that the true significance 
level of F lies somewhere between these values. 

If the observed F in our data falls outside these limits, a definite con- 
clusion can be reached from the test without further knowledge of the 
exact significance level of F. This happens in the example, where the 
observed E-ratio for the interactions m.s. is 3.96. However, even if the 
exact significance level were known, the test can be criticized because 
the E-ratio is no longer the most sensitive test criterion. Studies have 
sho^vn that, with the amount of variation in cr/ that appears typical of 
agricultural experimentation, this loss of sensitivity might be equivalent 
to discarding 10 to 20% of the data. An approximate test that avoids 
some of the loss in sensitivity will be presented in section 14.4. 

Thus far we have been discussing how the E-test of the interactions is 
affected by heterogeneity in the error variances. The E-test of treat- 
ments may now be considered. ^ If the interactions are regarded as neg- 
ligible, so that the denominator of E is the pooled m.s. for interactions 
and error, the E-test of treatments is invalidated in about the same way 
as that of interactions. On the other hand, when interactions are 
present, and especially when they are large, the E-test of the treatments 
m.s, against the interactions m.s. is much less disturbed. The reason 
may be clearer if we consider the test of the linear component of treat- 
ments in table 14,5. The suggested denominator for this test was the 
linear X places component, 1473- This is nearly 14 times the pooled 
error m.s. It appears, therefore, that heterogeneity in the error variances 
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can influence only a small part of the interaction m.s., so that its effect 
is, as it were, greatly reduced. These remarks apply with much less force 
to the quadratic X places m.s., to which, taking the data at their face 
value, the pooled error contributes more than half. 

14.23 Summary. When all experiments are identical in structure, a 
combined analysis of variance can be computed relatively easily. Ap- 
plication of the ordinary tests of significance to this analysis is frequently 
open to question because of heterogeneity in the error and interaction 
variances. Despite this, it is advisable to dra^v preliminary conclusions 
as far as possible from this analysis, at least in the present state of our 
knowledge, because tests that are fully efficient and theoretically sound 
have not yet been discovered, and the approximate tests that have been 
devised to meet the criticisms are more laborious. 

When the interactions are sizable, there is usually little difficulty in 
interpreting the combined analysis. For, if the F-ratio for interactions 
is large, we may be confident that it is statistically significant in spite of 
some uncertainty about the exact significance level of F, and the crit- 
icism that the F~test is not fully sensitive carries less weight if F estab- 
lishes significance. Further, as we have seen, the F-test of the treat- 
ments m.s, against the interactions m.s. is little affected by inequality 
in the error variances when the interactions are large. The chief point 
to remember is that the interactions m.s. may itself be heterogeneous. 
Subdivision of the treatments and interactions m.s. according to the 
treatment comparisons that are of greatest importance is often 
useful. 

Another situation that presents little difficulty occurs when the aver- 
age differences among treatments are substantial, yet interactions are 
negligible. In this case it may be found that neither the F value for in- 
teractions nor that for treatments is close to the significance level. The 
cases that leave us in doubt are those where the F values are just slightly 
above the tabular significance levels. 

14.3 Experiments of Unequal Size 

14.31 Numerical Example. Provided that the same set of treatments 
appears in all experiments, a combined analysis can usually be con- 
structed even when the experiments differ in size and structure. Some 
issues arise, however, that are not encountered when all experiments are 
identical in design. The problems will be illustrated by the inclusion of 
the first experiment in the previous example. This was arranged in ran- 
domized blocks with 4 replications, whereas all other expeidments were in 
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5X5 latin squares'^. The treatment totals and numbers of replications 
are shown in table 14.6. 

TABLE 14.6 Tbeatment totals (in 100 lb.) 


Pounds 

P 2 O 5 

1 

2 

Experiment 

[■■a I.-' 4 1 

5 

6 

Total 

No. of 
plots 

0 

456 

710 

900 

850 

650 

850 

4,416 

29 

40 

824 

880 

940 

1 855 

700 

890 

5,089 

29 

80 

924 

1,005 

1,035 

940 

750 

935 

5,589 

29 

120 

948 

1,025 

1,040 

925 

i 760 

940 

5,638 

29 

160 

1,008 

1,085 

1,085 

945 

780 

945 

5,848 

i 

29 

Total 

1 4,160 

4,705 

i 5,000 

4,515 

3,640 

4,560 

i 26,580 

145 

No. of plots 

20 

1 ■ 25 

25 

25 

25 

25 




A combined analysis can be obtained by following the standard pro- 
cedure for data based on imequal numbers. The square of any quantity 
is divided by the number of replications involved. The sums of squares 
are given below. 

Total- ^ + (710)^ + ■ - + (945)^ (26,580)^ _ 

4 5 145 


(4160)2 (4705)2 + . . . + (4560)2 (26,580)2 

Places: t:;— ^ __ 

20 25 145 


== 55,509 


Treatments: 


(4416)2 +...+ (5848)2 (26,580)2 


29 145 

Treatments X places: 131,925 - 55,509 - 45,613 = 30,803 

The pooled error m.s. requires a little thought. If all experiments 
have the same true error variance orJ^j the best procedure is to weight 
each error m.s., by the number of degrees of freedom But if the 
experiments vary in accuracy, this weighted mean is a biased estimate 
of the component of error variance that enters into the interaction and 
treatments m.s. The correct component is 2 2 instead of 

S %’cr// Consequently, unless we are confident that is con- 

stant, it is best to weight the values by the numbers of replications. 
This gives 

, 4(106) + 5(83 + 158 + • • • + 52) 

7;: ;.:;;:. ;:X08:' .. 

29 

* Since experiment 1 contained additional treatments (not discussed hero), it pro- 
vided 21 error d,f. 


14.31 EXPERIMENTS OF UNEQUAL SIZE 4O3 

The F-test of tlie interactions m.s. against the error m.s. is carried out 
in the same way as with experiments of equal size, and is subject to the 
criticisms previously discus, sed. The F-test of the treatments m.s. 
against the interactions m.s. encounters a new difficulty that is due to 

TABLE 14.7 Preuimixahi an.alysis of variance for experiments 


Oi^ 

unequal size 


Source of variation 

d.f. 

g.S. 

m.s. 

Places 

6 

55,509 

11,102 

Treatments 

4 

45,613 

11,403 

Treatments X places 

20 

30,803 

1,540 

Pooled error m.s. 

81 


108 


the unequal numbers of replications and is present even if all the as- 
sumptions required for the analysis of variance are satisfied. Under 
these assumptions, the expectations of the principal mean squares in 
table 14.7 are as given in table 14.8. 


TABLE 14,8 Expbcteb values of mean squares (with unequal numbers of 

replications) 


where 


Source of variation 
Tn^atments 


Expected value of mean square 
o-u + + (22 n) — — 


Treatments X places a-J^ + f 2 or^^ 

Pooled error 

141 _ 

h = T-1-,T (E ri - ?i) = ~ (29 - 4.862) = 4.S28 
(p — 1) 5 


The complicating factor is that the coefBcient of in the expected 
treatments m.s. is not the same as that in the expected treatments X 
places m.s. ; in fact^ it is always larger. The difference is very small in 
this example, as it is whenever the experiments do not vary much in 
numbers of replications. In addition, and for the same reason, the in- 
teractions m,s. is not distributed as a multiple of chi-square; that is, it 
does not have the type of distribution required for the validity of the 
F-test. Both factors tend to make the F-test give too many significant 
results. 

A crude method which at least prevents us from being led too far 
astray is to adjust the F~ratio so as to remove the upward bias in ex- 
pectation. Let the expected values of the three mean squares be de- 
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noted by Bt, and respectively. Then, if the null hypothesis is true 
(all Tj equal) , it is >seen from table 14.8 that the three expected values are 
connected by the equation. 

Bt (k •— l)0e = kdfp 

where A) = h/h^ In other words, the test that we seek is a test that this 
relation holds. This suggests that F might be computed as 

+ (/v - 11 403 + 0.007(108) 

■ ■ pf == 1 2...+-^-— -l-l. = ^ = 7,35 

k8tp^ (1,007) (1540) 

as compared with the original F-ratio of 11,403/1540, or 7.40. The sig- 
nificance level of F' will also be altered slightly from that in the table. 

When interactions are likely to be large, and the chief interest centers 
in the test of the treatments m.s. against the interactions m.s., an al- 
ternative approach has much to commend it. This is to compute the 
analysis from the unweighted treatment means given in table 14.1, paying 
no attention to the numbers of replications. With this type of analysis 
the bias disappears, the expectations of the mean squares being shown 
in table 14.9. It will be noted that cre^ and carry the same coefficients 


TABLE 14.9 ExpnoTED values of mean squakes in an unweighted 

■ ANALYSIS ^ 

Source of variation Expected value of mean square 

_2 , „E(^y-T)® 


Treatments 


rh ^ (t - 1) 


Treatments X places ~ + o* J 
■ fh 

in both expectations. The divisor fu is the harmonic mean of the num- 
bers of replications, given by Vh = v! Moreover, if is much 
larger than fSQlfk^ the interactions m.s. tends to be distributed as a 
multiple of chi-square, so that the conditions for the F-test are closely 
approximated. 

It is suggested that the pooled error m.s. for insertion in an analysis 
of this type be calculated from the formula 

v\n ^2 Tp/ 

By an extension of the i/esults in table 14.9 to the case where the experi- 
ments have different e^or variances, this quantity is found to be an un- 
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biased estimate of the error component that enters into the mean squares 
for treatments and treatments X places. For the data in table 14.1 to 
have 

1 /lOf) 83 + 158 +*-*+ 52\ 

~ "7 ( — H ) = 22.4 

6 \ 4 5 / 


TABLE 14.10 UNwinouTED analysis of vabi^vnce of treatment means 


Source of variation 

d.f. 

s.s. . 

in.s. 

Places 

5 

11,692 

2,338 

Treatments 

4 

10,554 

2,638 

Treatments X places 

20 

7,158 

358 

Pooled error 

81 


22 


Table 14.10, being in terms of treatment means, is in different units 
from table 14.7 (p. 403). As a conversion factor, the harmonic mean % 
which in this case is 4.8, ma3r be used to multiply the mean squares above 
for comparison with those in table 14.7, though this conversion is not 
needed for our purpose. 

In a choice between a weighted and an unweighted analysis, the fol- 
lowing are the relevant considerations. The statements below are 
strictly true only if all experiments have the same error variance per 
observation (or plot), though they should remain substantially true with 
a moderate variation in The weighted analysis is superior for the 
F-test of the interactions. It gives a more powerful test, and the F-ratio 
approximates the tabular distribution of F more closely. It is also 
superior for the F-test of treatments if intei’actions are non-existent or 
small, since in this case the bias in the test is negligible. The unweighted 
analysis ivS preferable for the F-test of treatments when the interactions 
are not negligible. As it happens, the results in the example are not too 
\vell in accord with these statements. For the interactions test, the F- 
ratio is 14.26 with the weighted analysis and 15.98 with the unweighted 
analysis. The explanation of the higher value with the unweighted 
analysis is probably that experiment 1, whose responses differed from 
those in the other experiments, had the smalle>st number of replications. 
The values of F in the test of treatments were practically identical, being 
7.35 (after adjustment) in the weighted analysis and 7.37. in the un- 
weighted analysis. 


14.32 The Combined Analysis for a Series of Lattice Experiments. In 
the discussion of lattice experiments it was pointed out that if inter-block 
information is recovered, the F-test of treatments in an individual ex- 
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periment is not exact, because the relative weights attached to inter- and 
intra-block estimates are subject to sampling errors. For the same rea- 
son any combined analysis must also be approximative rather than exact. 

The designs need not be identical at all places. With 25 treatments, 
for example, some experiments might be simple lattices in 4 replications, 
while others are lattice squares in 3 replications. Also, inter-block in- 
formation need not be recovered in all analyses ; at some places we might 
have used a randomized blocks analysis. 

If the experiments have different numbers of replications, the pre- 
vious section has indicated that there is a question whether to use a 
weighted or an unweighted analysis. The former is preferable when the 
chief purpose is to test the interactions with places, and when vsuch in- 
teractions are negligible; the latter when interactions are sizable and the 
principal interest is in a test of the average effects of treatments. With 
lattice experiments as used in agriculture, we are often interested in both 
interactions and average effects. But as interactions are seldom absent, 
the unweighted analysis is frequently advisable, and has the virtue of 
being slightly simpler. 

The weighted analysis will be described first. First form a two-way 
^Treatments X places^^ table similar to table 14.6. The entries in the 
table are the totals, except in experiments analyzed 

by randomized blocks, where unadjusted totals ax’e used. This table is 
analyzed into components for 

Places 
Treatments 
Treatments X places 

If the numbers of replications n differ, the weighted analysis follows the 
same procedure as in table 14.G (p. 402). The square of every marginal 
treatment total is divided by in order to reduce the analysis of vari- 
ance to a single-observation basis- The square of a place total, summed 
over all treatments, is divided by tvij where t is the number of treatments. 

The only additional component is the pooled error m.s. In any ex- 
periment where inter-block information has been recovered, we use as 
the estimate of the error variance per observation (or plot) the effective 
error in.s. E/, This quantity is obtained by adjusting the intra-bhxik 
m.s. upwards so as to allow for sampling errors in the block corrections, 
and its calculation is explained along with the analysis for each type of 

■ p.. . ■ . ■ ■ ■ .^T ■■ 

lattice. For the triple lattice, for example, i?/ is E, ' 


1 + 


/w *4“ 1 


If 


there are any experiments in which the intra-block analysis has been 
carried out, E/ is given the value which it takes when Ei becomes very 
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large relative to Ep For experiments analyzed by randomized blocks, 
F?/ is the ordinaiy error m.s. 

On the assumption that the true effective variances are unlikely to be 
the same in all experiments, the pooled eiTor m.s. is calculated as 
When I'i is constant, this reduces to the unweighted 
mean of the i?/. Tests of significance are made as in the numerical ex- 
ample, and are subject to the same general criticisms. Since the larger 
lattice designs provide ample degrees of freedom for erroiy the pooled 
error may be relatively well determined even if some experiments are 
much more accurate than others, so that there is less uncertainty about 
the true significance levels of the i^~ratios. 

For an unweighted analysis with experiments of unequal size, con- 
struct a two-way table of adjusted treatment means. This is analyzed 
by the simple standard procedure. The comparable pooled error m.s, is 
taken as 


With balanced designs, the method given above is a natural extension 
of the technique used to obtain an approximate F-test of treatments in 
an individual experiment. With partially balanced designs the method 
is more crude than that used in single experiments, where a special sup- 
plementary calculation was made for the F-test. This calculation can be 
extended to apply to a combined analysis, but it is doubtful whether the 
elaboration is worth while. 


5/ 


^ 1/^ 
pXn 


E2' E, 

+ ■ — + • • • + 
V\Ti r2 r, 


r /\ 
I 

1) / 



14,4 A Test of the Treatments X Places Interactions I 

In this section we give an approximate test of the treatments X places f 

interactions for series of experiments where there is considerable varia- 
tion in the experimental error variances. It was suggested that the F- 
test, despite its imperfections, will often serve our purpose. The present 
test differs from the F-test in that it gives less weight to experiments 
which have a high error variance, and consequently may be expected to 
be more sensitive. It may be useful in cases where there is doubt about 
the verdict given by the F-test and in cases where the most efficient 
analysis is desired. 

As before, let Xij be the mean of the jth treatment, and be the true 
error variance per observation in the ith experiment. We assume that 
all experiments have the same design, and let s/ be the error m.s. per ob- 
servation in the ith experiment, based on n degrees of freedom. 

B^or known values of cr/ the theory of least squares indicates that the 
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mean rr*/ should receive a weight Wi = In a weighted analysis of 

variance of the treatment means, the interaction s.s. is known to be 
distributed as chi-square with (p — l){t - 1) degrees of freedom. This 
sum of squares appears to be the best test criterion available, unless we 
possess specific information about the type of interaction that may exist, 
in which event a more specialized criterion would be constructed. 

In default of knowledge of the cr/, the natural step is to consider a 
weighted analysis with weights Wi ^ r/s/. That is, we weight each 
mean inversely as its estimated variance, since we do not know its true 
variance. This is satisfactory, provided that the are good estimates 
of the cr/. Upon examination it appears that the s/ should be based on 
at least 15 di. To illustrate the calculations, the test will be applied to 
the data from the last 5 experiments, though the number of degrees of 
freedom, 12, is slightly too low to use the test with full confidence. 

TABLE 14.11 Treatment mean yieeds (100 lb. per acre) fob a weighted 

ANALYSIS OF VARIANCE 


Pounds 


Experiment 


Weighted total 


P 20 S 


3 

4 

■. 5 . ; ' 

' -'B '! 

Y^u'iXti=Tj 

0 

142 

180 

170 

130 

170 

53.570 

40 

176 

188 

171 

140 

178 

57.000 

80 

201 

207 

188 

150 

187 

62.194 

120 

205 

208 

185 

162 

188 

62.264 

160 

217 

217 

189 

156 

189 

63.932 

m 

0.060 

0,032 

0.116 

0.025 

0.096 

0.329 - If 

Total (Pi) 

941 

1000 

903 

728 

912 


W-iPi 

56.460 

32.000 

104.748 

18.200 

87.552 

298.960 = G 

S.S. (8i) 

t 180,655 

200,946 

163,431 

106,440 

166,618 



The arrangement of the data in table 14.11 should be noted. The treat- 
ment means and s/ values come from table 14.1 (p. 393). The weights 
for individual entries in the table ai*e placed in the row immediately be- 
low the entries; for example, wi = 6/83 — 0.060. The total of the 
weights is denoted by W. 

1. Form the column totals Pi and the products Form the 

weighted row totals. The corner total, Q == 298.960, supplies a check 
on both the column and row totals. 

2. The items in the analysis of variance are obtained as follows. 

Correction term: 

^ (298.960)^ 

tw (5)(0.329) 
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Total: First compute and record in the table the sum of squares 8i 
(uncorrected) of the entries in each column. The total s.s. is then found 

as 

2 {WiSr) - C - ( 0 . 060 ) ( 180 , 665 ) + • • • + ( 0 . 096 ) ( 166 , 618 ) - (7 = 

551.33 

Places: A column (place) total has variance and hence receives 

a weight Wi/L The sum of squares is 

- 0 = 230.09 

This is found readily by using the products WiPi. 

Treatments: Each treatment total has estimated weight If. The sum 
of squares is 

2^ Ty2 (53.570)2 + • • • + (63.932)2 

- C = ^ ^ - C = 229.57 

W 0.329 



C = 229.57 


Treatments X places: Since this is the only component in which we are 
eurrentl.y interested, it is imfort, unate that it must be found by means of 
the others. As usual, it is given by 

551.33 - 230.09 - 229.57 = 91.67 = I 

The sum of squares I does not follow a chi-square distribution, being 
inflated by errors in the w^eights. It can be reduced to a quantity that is 
distributed appro.ximately as chi-square.* We take 

^ (n - 4)(n - 2) . (p - l)(f - l)(ra - 4) 

_ -.- i, 

7i(n -f f — B) (n + t — 3) 

In this ease 


- 9.14 dX 


(8) (10) , (16 (8 
^ (91.67) = 43.65, with—*— 
(12) (14) (14) 


In this approximation the degrees of freedom ascribed to chi-square are 
not integral. Significance levels are obtained by linear interpolation in 
the tables of chi-square. Since the 1% levels of chi-square are 21,67 and 
23.21 for 9 and 10 d.f,, respectively, I is obviously significant. 

The test can also be applied to any component of the interactions s.s. 
As an example, we will test the interaction of the quadratic component 
of the regression on amount of phosphorus. This was previously tested 
by* the jP-test in table 14.5, and found non-significant. The values given 

* This test is an extension and modification of a test previously given by Cochran 
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14.4 


for the quadratic components in section 14.21 were deiived from treat- 
ment totals; to obtain corresponding values from means, we divide by 5. 

Experiment 



2 


4 

5 

6 

Total 

Qi 

WiQi 

-65 

- 3.900 

-16 

- 0.512 

-14 

- 1.624 

-20 

- 0.500 

-22 

- 2.112 1 

-8.648 


Each quantity has variance 14 cr//r, and receives a weight The 

sum of squares is 

1 1 


(340,89 - 227.32) - 8.U - I, 


The quantity Iq is the sum of squares for the interaction of a single 
treatment comparison with all p places, and would normally have 
(p — 1) degrees of freedom. To present the general formula for con- 
version to chi-square, suppose that we have computed the sum of squares 
for the interaction of ii treatment comparisons with pi of the places, so 
that there would normally be ti (pi — 1) degrees of freedom. Then 




2 _ 


(n — 4)(n 
n(n + — 2) 


2) fi(Pi 

Iq, With 


l){n - 4) 


(Ji + ~ 2) 


d,f. 


This formula is in accord with that used for the whole {t ” l)(p — 1) 
degrees of freedom in the interactions. Since the t treatments provide 
{t — 1.) independent treatment comparisons, we would take ti ~ {t — 1) 
in applying the formula to the complete interacitions s.s. For ti — \, 
P = ^ 


( 8 )( 10 ) 

( 12 )( 11 ) 


(8.11) = 4.92, ivith 


0X8) 

( 11 ) 


= 2.91 d.f. 


The tables show that this value has a probability between 0.2 and O.l, 
since by interpolation the 20% level for 2.91 d.f. is 4.51, the 10% level 
6.10. The earlier F-test gave a probability slightly over 0.2, so that the 
two tests are in close agreement. 

If the experiments differ in size, the weight Wi becomes rf/s/. The 
numbers of error degrees of freedom wll also vary. This necessitates a 
more elaborate conversion formula; as a rough approximation the aver- 
age number of error degrees of freedom per experiment is used in place 
oin. ^ 

. The test is not recommended for low values of n, because the s/ are 
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relatively poor estiniates of the and frequently one or two oxpori- 
ineiits receive such high weights that they dominate the analysis. Ap^ 
propriate methods are presented by Yates and Chchran (1.4.5) and 
Cochran (14.(3). 

14.5 Repetitions in Both Space and Time 

As has been mentioned^ agricultural field experiments arc often re- 
peated both at a number of places and for a number of years. We will 
give only an introduction to the simplest case, in which experiments have 
been carried out at each of p places for y ^^ears. At any place a new site 
is chosen each year for the experiment, and a new randomization em- 
ployed, so that the data from successive years may be regarded as 
independent. The experimental arrangement need not be uniform 
throughout the whole series of experiments: in fact, a change in design 
between seasons is not uncommon. 

The mathematical representation of the mean of the jth treatment at 
the tth place in the icth year is now more lengthy. In addition to the 
symbols previously used for place and treatment effects, yjc will denote 
the effect of the year. Interactions are denoted by multiple symbols: 
thus (7rT)^-j is the contribution of the place X treatment interaction in 
this experiment. With this notation 

Xijk = M + Tf*!* + Tj + 7k + ('n’r)ij + (7r7)^fc + (ry)jk + eijk 

Note that there are treatment X place, treatment X year, and treat- 
ment X place X year intei'actions. 

In experimental programs of this type, it is usually hoped that the 
places and years constitute a representative sample of the population of 
places and 3 ^ears to which the results will be applied. There are obvious 
practical difficulties in choosing places and years that can be confidently 
asserted to be such a representative sample, and sometimes little effort 
is made to ensure that this will be so. The hard fact is that an}^ sta- 
tistical inferences drawn from an analysis of the data will apply only to 
the population (if one exists) of which the experiments are a random 
sample. If this population is vague and unreal, the analysis is likely to 
be a waste of time, at least from the strictly practical point of view. 

It seems appropriate to regard all the quantities in the equation as 
random variables except the general mean ix and the true effects ri of the 
treatments, because if we could tabulate all the values of say (7rr)«7 in 
the population, they would follow some frequency distribution from 
which the values in our data are a sample. For the full application of 
the analysis of variance we require the assumptions that the . experi- 
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mental errors and all interactions of treatments are normally and in- 
dependently distributed j with variances that are denoted by use of the 
letters that enter into the interaction. 

An unweighted analysis of variance of the treatment means is ob- 
tained by the standard procedure for factorial experiments. The im- 
portant parts of his analysis are sketched in table 14.12. 


TABLE 14,12 Analysis of vakiance of treatment means with time and 

PLACE VARIATIONS 


Source of 
variation 

d.f. 

Expectation of mean square 

Treatments 

(t-i) 

(Tfi- -f 4" 4- -h P2/ 

Treatments 



X places 


2 t 2 I V) 

O'fi "T ■T' 

Treatments 



X years 


cr j 4" 4“ 

Treatments 



X places 

X years 

(f - l)(p - l)(y - 1) 


Pooled error 


■ 


In the most general case, <r/ is the average value of (ra^/vik^ where 
is the error variance in the individual experiment at the /th place in 
the Ath year, and is the number of replications in that experiment. 
Consequently, the pooled error should be estimated as the average of 

The method to be followed in testing the significance of the successive 
terms is made clear by the expected values of the mean square. The 
treatments X places X years interactions are tested against the pooled 
error. Both treatments X years and treatments X places are tested 
against treatments X places X years. 

The situation with regard to the test for treatments is interesting. It 
is evident that no other mean square in the analysis is suitable as a de- 
nominator of F, unless either the treatments X places or the treat- 
ments X years interactions appear to be negligible, so that the corre- 
sponding variance is assumed to be zero. If this happens, the other 
interaction m.s, is an appropriate denominator for F (subject to some 
uncei'tainty in case the assumption should not be correct). 

When both two-factor interactions are present, the hypothesis that 
all Tj are zero is equivalent to the hypothesis that 

== ^tp + Oty 
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where the (?’s stand for the expected mean squares in table 14.12. No 
exact tost for this kind of relationship is at present known. By analogy 
with the i^'-test, one suggestion is to use the ratio 

“b •^ipy 

where the values are the mean squares in the analysis of variance, 
'fhis ratio does iiot follow the /'-distribution, but following an appi-oxi- 
ination suggested by Satterthwaite (14.7) and others, we might use the 
ir-tables with %' and /%' degrees of freedom, where 

m' - — — — - — —•■■■ 

St ^ Slpy 

^tpy 

where the n^s are the numbers of degrees of freedom in the corresponding 
mean squares. The analogous expression is used for 712 . 

After completion of the analysis, the next step is to examine the nature 
of the interactions and average effects of the treatments, bringing to bear 
any external knowledge that will throw light on the interpretation. 
Often, for the practical uses of the results, we would like to decide 
whether a single recommendation can be made for application in the 
whole population, or whether, on the other hand, it is necessary to have 
several recommendations for different parts of the population. These 
decisions are facilitated by calculating confidence limits for the dif- 
ferences among the means of the most likely candidates. For a discus- 
sion of methods for making this calculation, which can give rise to some 
complications, see reference (14.5). 
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CHAPTER 15 

RANDOM PERMUTATIONS OF 9 AND 16 NUMBERS 

16.1 Use of the Random Permutations 

In practically all the experimental designs, the randomization con- 
sists in arranging a set of objects, whether treatments, blocks, rows, or 
columns, in random order. If the number of objects does not exceed 9, 
a randomization is obtained at once from table 15.6, which contains 
1000 random arrangements of the numbers from 1 to 9. For example, 
to arrange 7 treatments in random order, select a starting place in table 
16.6 (without inspection of the numbers in the table). Suppose that the 
permutation chosen is 

1 8 9 2 6 7 5 3 4 
Omitting the digits 8 and 9, we obtain 

,. /■ T-:'2 ' .6:;: 7 5 . 3 'T' : ■ ■ 

for the desired arrangement. Similarly, table 15.7, which contains 1000 
permutations of the numbers between 1 and 16, may be used for any 
number of objects between 2 and 16. 

Occasionally the randomization involves dividing the objects into 
groups, as in the completely randomized and cross-over designs. Thus, 
with a cross-over design (section 4.4) having 2 treatments and 10 blocks, 
treatment A must appear in the first row in five of the blocks, chosen 
at random, and in the second row in the remaining five blocks. Select a 
random permutation of the numbers up to 16 from table 15.7, say 

6 16 10 5 12 7 9 4 1 2 15 14 13 11 8 3 

The numbers from 11 to 16 are ignored. The first five of the remaining 
numbers, 6, 10, 5, 7, and 9, are chosen as the blocks in which -4 appears 
in the first row. 

If the number of objects to be randomized exceeds 16, see section 15.3. 

16.2 Construction of the Random Permutations 

The idea of using random permutations instead of the ordinary ran- 
dom digits was suggested by Professor George W. Snedecor. Before the 
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writing of this book, 200 random permutations of each of the numbers 
from 3 to 13 had been made at the Statistical Laboratory at Iowa Stale 
College. These were obtained by first constructing a large number of 
ordinary random digits, which were then transformed into random per- 
mutations. The method of transformation was the usual one: it may be 
illustrated for the numbers from 1 to 11. To obtain a random per- 
mutation of the numbers 1 to 11, a series of pairs of random digits is 
taken, say 

05, 04, 29, 82, 37, 52, 53, etc. 

When we divide each number by 11 the remainders are 

10, 4, 7, 5, 4 (omitted), 8, 9, etc. 

This process is continued until 10 of the 11 numbers have appeared, 
repetitions being omitted. 

Rather than present 200 permutations of each of a series of numbers, 
it was decided to concentrate on two numbers. This decision was made 
par% because any permutation of 9 numbers, say, can be used to give 
permutatiGns of all numbers less than 9. Thus 1000 permutations of 9 
numbers are more useful than 200 of each of the numbers 9, 8, 7, 6, and 
5. Also, certain numbers, such as 11 and 13, are less often used in ex- 
periments than others. The numbers 9 and 16 were selected for presen- 
tation because of their occurrence in factorial experiments. 

The 1000 permutations of 9 were obtained from the permutations 
already available for 9, 10, 11, 12, and 13, by omitting the numbers 
above 9. Of the permutations of 16, 800 were obtained from random 
digits. With as many as 16 numbers to permute, the method of division 
as illustrated above is rather slow. A more rapid alternative, suggested 
by Mr. Paul Peach, was used. In this method 16 pairs of random digit s 
are taken, for example, 

75 10 17 28 91 85 74 56 71 06 26 01 06 21 07 60 

14 5 6 9 16 15 13 10 12 3 8 1 2 7 4 11 

. The permutation is produced by ranking the pairs in order of size, as 
shown below each pair. In the event of a tie, as happens in this example 
with the two 06’s, the two ranks in question may be allotted from an ad- 
ditional random digit. Ties occur frequently (the probability that all 
16 pairs are distinct is about .35) but cause little delay. Although the 
method has the additional merit of using fewer random digits than the 
method of division, with 32 random digits per permutation a large supply 
of random digits is required. The remaining 200 permutations were ob- 
tained by drawing numbered marbles from an urn. 
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16.3 Randomization of More than 16 Numbers 

The most suitable method depends on the facilities available and on 
the frequency with which randomizations have to be made. Navy beans, 
drawn from a box, have been found quite expeditious and convenient, 
though the numbers on the beans rub off with repeated usage. Ah 
ternatively one of the standard sets of random digits, references (15.1- 
16.4), may be used. For example, with 25 numbers to be placed in ran- 
dom order, choose a starting place in the table and select the next 25 
sets of three-digit random numbers. These numbers are then placed in 
increasing order by the method used in the previous section for the per- 
mutations of 16. Three-digit random numbers are taken instead of two- 
digit numbers in order to avoid ties. With three-digit numbers the fre- 
quency of ties is negligible even if as many as 100 numbers are to be 
randomized. If punched card machines are available, the numbers 1 to 
25 may be punched in columns 1 and 2 and the random digits in col- 
umns 3, 4, and 5 of the card. The permutation may then be produced 
and printed by sorting on columns 3 to 5. By filling all the columns from 
3 onwards with random digits, a number of permutations can be obtained 
by sorting on different sets of columns, though the number is limited if 
the permutations are to be kept independent of each other. 

16.4 Tests of Randomness 

Many tests of different aspects of the randomness of the permutations 
could be made. So far as their use in experimental design is concerned, 
the test most immediately relevant is a test of the null hypothesis that 


TABLE 15.1 Number of occurrences op 1, 2, • • •, 9 in 1st, 2nd, • • 
9th position in 1000 permutations op 9 


Number 

1 

2 

3 

4 

Position 

5 

6 

7 

8 

■ : 1 

Totals 

1 

130 

111 

111 

108 

94 

121 

114 

106 

105 

1000 

2 

120 

118 

104 

102 

102 

133 

98 

108 

115 

1 1000 

3 

92 

100 

116 

111 

113 

no 

116 

123 

119 

I 1000 

'U: 

121 

98 

97 

108 

117 

112 

124 

102 

121 

1000 



103 

120 

140 

122 

84 

113 

113 

no 

1000 


107 

118 

118 

88 

113 

120 

124 

111 

101 

i 1000 



119 

121 

124 

110 

94 

95 

113 

116 

i 1000 


: 115 

116 

no 

98 

107 

109 

117 

121 

107 

i 1000 



117 

103 

121 

122 

117 

99 

103 

■:'106 

1000 


1000 

1000 

1000 

1000 

1000 

1000 

1000 

1000 

1000 

! 1000, 



TABLE 15.2 Numbisb of occtorences of 1, 2, •••, 16 rat 1st, 2nd, •••, 16th fosition in 1000 pbbmptations op 16 
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the numbers 1; 2, * •*, 9 have an equal probability of appearing in the 
1st, 2nd, * • • , 9th positions. This test is made by counting the number 
of times that each number has occurred in each position. The results 
are shown in two-way presentations in tables 15.1 and 15.2. 

For the permutations of 9, the expectation in each cell is 1000/9, or 
111,1. Each row and each column in table 15.1 adds to 1000, Taking 
the values in a single column, we may test by whether the 9 numbers 
are equally represented in the corresponding position. If the a:^s are the 
observed enti'ies in the column, 


(,r — 7ny^ 


m 


9 

1000 


_/ loooy 


For computational purposes, this simplifies to 

x" = (a;") - 1000 ■ 

Thus, for the first position, 

== T^(130^ + 120^ + > • ■ + 112^) -- 1000 - 10.81 

with 8 d.f. Similarly, from the rows we may test whether any given 
number has appeared equally often, apart from sampling fluctuations, 
in each position. Although the two sets of tests are not independent, 
both are of interest. The values of x? are shown in table 15.3. 


TABLE 15.3 Values of yc from 1000 permutations of 9 


Position 


Number 


test 

X“ 

test 


1 

10.81 

1 

7.46 

2 

5.19 

2 

9,17 

3 

5. OS 

3 

6.70 

4 

17.16 

4 

7.75 

5 

6,06 

5 

18.90 

6 

15 . 52 

6 

8.97 

7 

8.82 

7 

8.25 

8 

3.88 

8 

3.45 

9 

3.44 

9 

5.32 

Totals 

75.96 


75.97 


In the position tests, the individual probability values range from 
.90 (x^ = 3.44) to ,028 (x^ = 17.16). With 8 d.f., the probability that 
the largest of 9 independent values of x^ should exceed 17.16 works out 
at .23. Actually, owing to the fact that the cell numbers add to 1000 
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in any row or column, the 9 values for the position have a small pos- 
itive correlation which reduces the probability value for the largest 
to about .20. The individual probability values for the number tests all 
lie between .90 (x^ = 3.45) and .016 (x^ = 18.90). The probability that 
the largest x^ should exceed 18.90 is about 0.11. Thus neither series of 
tests indicates any marked departure from randomness. 

The total, 75.97, which is (x — m)^fin taken over all cells in table 
15.1, supplies a composite test of the complete table. On accoiint of 
the positive correlation referred to in the previous paragraph, the value 
75.97 does not itself follow a x^ distribution. In fact, it may be shown 
to be distributed approximately as (9/8) x^, where x^ has 64 d.f. Con- 
(8) (75.97) 

sequently we may test — , or 67.52, as a x^ value rvith 64 d.f. 

As would be expected from the results of the previous tests, the prob- 
ability value, .36, provides no evidence for the rejection of the null 
hypothesis. 

The corresponding values for permutations of 16 are given in table 
15.4.^ 

TABLE 15.4 Values of from 1000 permutations of 16 


Position Number 


test 

f> 

test 

o 

x*' 

1 

14.40 

1 

20.25 

2 

17.70 

2 

12.96 

3 

It .87 

3 

16.58 

4 

15.14 

4 

4.80 

5 

17.70 

r> 

14.94 

6 

7.42 


9.57 

. 7 ■ ■ 

12.29 

7 

13.98 

8 

7.42 

8 

13.63 

\) 

24.54 

9 

15.55 

10 

9.15 

10 

12.45 

11 

10.62 

11 

14.05 

12 

10.51 

12 

18.50 

13 

9.92 

13 

14.50 

14 

18.85 

14 

13.06 

15 

6.98 

15 

14.78 

16 

19.36 

16 

10.21 

Totals 

219.87 


219,87 


The only figure in the group which has an unusual probability is the 
low value 4.80 for number 4. This has an individual probability of .9932 
(15 d.f.). When account is taken of the fact that this is the smallest of 
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the 16 values for the number tests, the probability is found to be .10. 
The total, 219.87, is distributed as (16/15) with 225 d.f. and is found 
to be slightly but not abnormally below its expectation. 

A further test of randomness was made by counting the number of 
imei^simis foY each permutation. The inversions are counted as follows. 
For each number, record how many numbers to the left are greater than 
the number. Thus for the permutation 

3 4 8 1 9 7 2 6 5 

we record 

0 0 0 3 0 2 5 3 4 

The total (3 + 2 + 5 + 3 + 4 — 17) gives the number of inversions. 
The extreme values for the number of inversions are zero (for the per- 
mutation 12 3 4 5 6 7 8 9) and 36 (for the permutation 98 7 654321). 

TABLE 15.5 Observed and theoretical frequencies of numbers of 


INVERSIONS FOR 

PERMUTATIONS 

OF 9 

Number of 

Observed 

Theoretical 

Goodness 

inversions 

frequency 

frequency 

of fit, 

0-6 

S 

6.3 

0.46 

7 

4 

6.0 

0.67 

8 

11 

9.9 

0.12 

9 

10 

15.3 

1.84 

10 

13 

22.1 

3.75 

11 

36 

30.4 

1.03 

12 

38 

39.7 

0.07 

13 

50 

49.5 

0.01 

14 

71 

59.1 

2.40 

15 

65 

67.8 

0.12 

16 

SO 

74.6 

0.39 

17 

71 

79.0 

O.Sl 

18 

90 

SO. 6 

1.10 

19 

75 

79.0 

0.20 

20 

80 

74.6 

0.39 

21 

57 

67.8 

1.72 

22 

58 

59.1 

0.02 

23 

45 

49.5 

0.41 

24 

40 

39.7 

0.00 

’^25 

27 

30.4 

0.38 

26 

27 

22.1 

1.09 

27 

18 

15.3 

0.48 

28 

7 

9.9 

0.85 

29 

11 

6.0 

4,17 

30-36 

’ 8 

6.3 

0.46 

Totals 

1000 

1000.0 

22.94 
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In intermediate cases the number of inversions serves as a measure of 
the extent to which the order is inverted from the natural order towards 
the reverse order. The frequency distribution of the number of in- 
versions has been studied by Rosander (15.5) and Kendall (15.6), who 
derived from the number a measure of the rank correlation between two 
series of figures. For random permutations of 1 to 10 numbers, Kendall 
also tabulated the exact distribution of the number of inversions* His 
distribution ivas extended to 16 figures for the test given here. 

For permutations of 9 the observed and theoretical distributions are 
shown in table 15.5. 

The total 22.94, has 24 d.f. and agrees closely with expectation. 
The corresponding value for permutations of 16 is = 47.52 with 54 
d.f. and a probability value of .72. 
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5 5 6 7 1 
4 12 8 2 
9 3 3 2 9 

7 9 7 4 3 
T6 9 6 5 

6 4 4 3 6 

8 7 8 1 7 

3 2 19 4 
2 8 5 5 8 

7 4 6 1 5 

9 3 8 3 2 

1 6 3 4 7 
6 8 2 8 4 

4 14 7 8 

2 9 19 3 

5 6 5 5 1 

8 2 9 2 9 

3 7 7 6 6 

9 7 7 5 6 

3 8 17 2 

4 3 4 2 7 

5 9 2 8 3 
16 5 11 

6 2 8 3 6 

2 4 9 6 4 

8 5 6 9 9 

7 13 4 8 

7 4 9 8 7 

5 6 112 

4 9 8 5 6 

3 3 2 2 8 
2 14 9 4 

9 7 5 4 5 

6 2 6 3 9 

8 6 8 7 1 

1 8 7 6 3 

8 4 6 8 6 

9 9 4 5 8 
6 6 8 1 1 

7 3 7 7 2 

2 8 9 3 4 

3 7 2 6 9 

5 18 4 5 

4 5 5 2 7 
1 2 19 3 


4 3 3 7 3 

7 112 9 

8 8 8 4 5 

5 5 2 9 2 

6 9 4 3 6 

2 4 6 8 1 

1 2 5 6 8 

3 6 7 5 7 

9 7 9 1 4 

9 2 2 2 9 
1119 8 

6 5 8 4 5 

4 8 7 8 6 

2 3 9 3 4 

7 9 6 6 2 

3 7 4 7 7 

8 6 5 5 3 

5 4 3 1 1 

9 9 9 3 8 

6 2 7 1 6 

7 3 17 2 

3 7 5 8 9 

5 6 4 4 1 

8 4 6 2 5 

1 8 3 5 4 

2 5 2 6 7 

4 18 9 3 

0 7 17 1 

6 4 6 1 4 
118 4 8 

5 2 3 2 2 
4 6 2 8 3 

3 9 7 9 9 
8 8 5 5 5 
2 3 9 3 7 

7 5 4 6 6 

2 19 9 7 

4 4 8 7 8 

6 8 3 1 9 

7 3 6 2 2 

1 5 6 5 1 

8 6 4 6 3 

9 9 18 4 

3 2 7 3 6 

5 7 2 4 5 


TABLE 15 

8 7 4 6 3 

9 5 7 8 2 

2 4 6 1 6 

1 6 5 3 5 

4 3 9 2 9 

7 9 3 4 1 

3 12 9 8 

6 8 8 7 7 

5 2 15 4 

2 8 17 3 
9 4 9 5 4 

6 17 19 
5 7 5 4 5 

4 2 2 3 0 
1 6 4 6 1 

8 5 8 9 2 

7 9 6 8 8 

3 3 3 2 7 

9 8 6 1 7 

4 13 4 2 

1 5 4 8 6 

2 9 17 1 

7 3 7 2 3 

5 2 2 6 8 

3 6 5 0 4 

8 7 8 3 9 

6 4 9 5 5 

9 2 3 8 7 

5 9 12 8 

3 5 4 9 3 

7 3 8 6 9 
2 7 6 5 1 

1 4 2 3 4 

8 6 7 7 2 

4 16 16 

6 8 9 4 6 

2 2 18 9 

8 7 5 9 7 

7 5 7 5 5 

3 8 9 4 6 

5 4 3 6 4 

4 18 2 1 
1 9 4 3 2 

9 3 2 1 8 

6 6 6 7 3 


9 7 4 9 4 

8 9 3 6 6 

3 6 7 7 8 

7 8 5 1 9 

5 18 2 3 

6 2 6 4 2 

4 4 18 7 
2 5 0 5 1 

1 3 2 3 5 

2 4 2 1 9 

8 8 8 8 6 

5 2 5 6 3 

9 6 7 5 8 

4 7 4 2 5 

7 9 9 7 4 
15 13 2 

3 16 9 7 

6 3 3 4 1 

5 8 6 1 2 

3 6 2 4 3 

6 2 16 1 
2 3 8 3 4 

4 7 3 8 8 
9 17 5 6 

8 5 0 7 9 

1 9 4 2 5 

7 4 5 9 7 

7 8 5 3 5 

2 4 6 8 7 

3 6 12 3 

4 18 6 1 

5 7 3 1 2 

6 9 7 4 4 

9 3 4 5 8 

8 5 9 7 6 
1 2 2 9 9 

6 19 2 4 

3 6 4 7 7 

6 5 18 5 

4 7 2 6 9 

7 8 7 5 3 

1 9 6 4 8 

8 2 8 9 6 

9 3 5 1 2 

2 4 3 3 1 


lTIOns of 9 

9 2 2 8 8 

1 7 7 2 4 

7 4 4 7 1 

5 19 1 3 

8 3 3 3 2 

2 9 8 5 9 

6 5 16 7 

3 8 5 4 6 

4 6 0 9 5 

2 4 8 3 1 

7 7 5 4 6 

8 5 7 5 5 

5 9 9 7 7 

6 3 3 6 9 
18 4 18 

9 6 2 8 4 
4 16 9 3 

3 2 12 2 

1 9 8 3 3 

2 6 12 8 

7 8 5 1 7 

3 5 9 9 9 
9 3 2 5 6 

4 7 4 6 4 

8 16 8 1 

6 4 7 4 5 

5 2 3 7 2 

5 16 4 9 

7 3 7 6 1 

2 6 8 7 7 

1 9 2 3 6 
9 8 4 1 3 

3 2 5 2 2 

8 7 9 9 4 

4 6 3 5 8 

6 4 18 5 

5 2 6 2 8 

3 8 5 3 6 

2 4 3 8 2 

7 9 7 4 1 

9 5 8 6 5 

4 7 2 1 3 

6 3 4 9 9 
1 6 9 7 7 

8 115 4 


2 7 9 3 5 

4 8 5 7 3 

7 3 2 8 6 

6 5 1 4 9 

8 9 6 1 2 

9 2 4 2 8 

6 4 3 6 1 

3 6 7 9 4 

118 6 7 

2 6 5 4 8 

5 3 2 7 6 

6 9 9 8 1 

8 5 3 3 5 

1 7 8 5 4 

9 2 7 9 3 

.38119 

4 4 6 6 2 

7 14 2 7 

3 17 7 3 

8 8 6 2 7 

5 9 13 6 

7 2 3 4 1 

6 6 9 5 9 

1 7 4 6 4 

4 5 5 9 5 

2 3 2 8 2 

9 4 8 1 8 

7 8 6 1 8 

5 17 4 1 

4 5 3 8 5 

3 9 5 7 7 

6 3 12 9 

8 4 2 6 3 

9 2 4 9 4 

1 6 8 5 2 

2 7 9 3 6 

1 6 8 8 3 

4 4 6 7 7 

5 14 3 6 

3 8 2 6 5 

8 2 7 9 2 

6 3 5 5 1 

2 7 12 4 

9 5 9 1 8 

7 9 3 4 9 


8 3 19 4 

3 7 4 5 6 
6 12 2 2 

2 9 8 7 8 

4 5 7 6 9 

9 6 9 8 1 
14 3 17 

5 2 5 4 5 

7 8 6 3 3 

8 4 9 4 2 

9 3 8 2 1 

3 6 7 9 7 

6 9 4 6 9 

4 5 2 1 4 

1 8 3 5 5 

5 7 13 3 

7 2 6 8 8 

2 15 7 6 

7 6 6 5 5 

8 9 7 4 7 

3 12 3 1 

5 7 17 8 

9 8 9 1 2 
1 2 8 8 6 
2 4 5 9 4 

6 3 3 2 3 

4 5 4 6 9 

2 9 7 3 4 
9 3 4 7 7 
8 5 9 5 1 
12 8 12 

6 15 8 8 

5 6 3 6 3 
4 8 12 9 

3 4 6 4 5 

7 7 2 9 6 

8 19 4 1 

6 6 8 7 8 

4 9 7 8 6 
3 5 3 1 4 

6 3 4 3 5 
2 2 6 9 9 

9 8 2 6 2 

7 7 15 7 
1 4 5 2 3 
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I TABLE 15.6 Pebmutations of 9 (Confanwed) 


I 862245458396522 

I 223398183765138 

I 766054266174496 

I 64453 75224 59984 

f 681829614547869 

i 199113941933375 

I 918481797621743 

i 335662875282617 

? 47777 63398 18261 

‘ 663489538216446 

I 237596175484691 

I 749824721967287 

i 522967259632618 

i 476172986859925 

I 894333612728869 

i 116218464191132 

i: 38164 13 9 3375354 

:i 9 0876 5847<. 43773 

I 742421226286522 

. 869878392311669 

i 964693137574288 

! 177636674897167 

; 228755915955946 

■ 691214851669394 
6 3 6 1 0 7 7 8 9 4 3 8 4 1 3 
416982643743835 

; 383349468122771 

! 975973235811761 

!l 2 5 775 2667227813 

i| 72339 617 4 453276 

[I 44862495 1 9 82 4 55 

63988 15296 64694 

I 661469312748942 

! 394647493599028 

I 882216786376187 

' 116138848136339 

25065 42832 48626 

81157 65718 77278 

49239 36666 34347 

78481 94493 56835 

54672 23946 96482 

32396 1817765169 
169236732923711 

■ 037188928411993 

■ 978447165182564 


4 9 8 2 9 8 6 6 6 6 3 5 6 6 6 1 1 3 9 1 
38916 3743593252 68135 
13375 9488864187 22882 
6859461277 1154944579 
74282 25644 48813 97668 
65141 489112993886267 
8 2 7 3 7 7 9 1 6 9 6 6 4 2 6 6 5 7 2 4 
27463 53792 87791 33413 
91668 12323 72374 79940 

84991 43379 9345932561 
37185 71913 69815 64492 
55662 88437 25731 55325 
62439 65621 34563 93244 
93228 29248 42042 79756 
48774 12694 76127 28073 
21857 37802 58296 47137 
19316 56156 87384 11918 
76643 94786 11978 86889 


61847 12183 97745 46147 
77293 54478 34117 65816 
99468 29617 46336 98378 
15785 65246 62598 14592 
23364 76634 26674 71964 
34929 91921 79261 33229 
56611 88892 53452 57785 
82636 47769 88923 22461 
48172 33355 11889 89633 


22663 73827 41767 89437 
48859 81989 79388 92192 
78232 38065 83826 60281 
99945 14346 95093 78849 
56577 90291 08442 37768 
67128 47773 64939 11615 
31784 02554 16174 44920 
80491 2941837551 23374 
14316 56132 22215 65553 

42413 98197 47496 13414 
31168 37265 85672 41959 
58236 81671 32824 32873 
19396 62832 53109 64765 
93549 16344 64948 25341 
75624 76788 1158 3. 67136 
67771 53556 98331 76698 
24957 24929 26217 98527 
80882 49413 79755 89282 
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7 

8 

8 
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2 

2 

2 

1 
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9 
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9 
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2 

5 

6 

5 
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6 

6 

1 

2 

7 

8 

9 

6 

5 

6 

1 

4 

8 

7 

1 

3 3 

8 

1 

3 

4 8 8 7 9 

9 

1 

4 

8 

5 

9 

8 

3 8 

4 

2 

3 

4 

4 8 

6 

7 9 
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7 

1 

5 

6 

5 

6 

3 

6 

1 

8 

6 

7 8 3 

6 

6 

8 

4 

2 

9 

2 

7 4 7 
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3 

1 

4 8 
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1 

5 
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5 
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9 
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7 
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3 

4 

1 
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4 

9 

6 

7 

2 

4 

4 6 8 

1 

5 

2 

3 

9 

3 

1 

4 

8 

6 

8 

3 

2 

8 

6 

2 

4 

4 

6 5 

7 8 

6 4 9 

6 

8 

5 

1 

1 

8 

3 

6 

4 7 6 3 

2 

7 8 

4 

6 

6 

4 

5 

6 

9 

8 5 2 5 6 

1 

9 2 

1 

1 

3 

5 


3 

2 

1 

9 4 4 9 

3 3 

1 

3 9 

8 3 7 5 

8 

3 7 6 2 2 


7 4 8 2 4 2 6 3 1 6 6 9 9 6 7 

6 6 9 4 5 9 3 6 6 1 3 5 1 5 3 

4 1 6 1 1 1 2 1 9 4 2 4 2 2 8 

1 9 4 5 2 5 7 9 7 5 4 7 8 1 5 

2 5 2 9 9 7 1 7 8 2 8 8 6 7 9 

327 7 7 64843 9258 1 
9318885428 71742 
68563 38259 16396 
87336 495 3 7 5 3 434 

32381 21148 97297 
6 9 9 1 9 1 5 4 6 6 3 6 8 6 0 
27778 77622 55159 
862 5 7 882 5 5 7 2 928 

5 1435 9 3913 14312 
93122 54874 29581 
1 5 0 4 4 0 2 3 3 1 6 3 6 4 5 
48893 46597 48774 
74566 39789 81433 

39863 86801 2 6 531 
94399 65615 71385 
16212 98532 12463 
77747 54358 44124 
43585 19779 87859 
82151 47924 59642 
25974 31147 93998 
68628 7348338217 
51436 22266 65776 

62716 48346 22675 
19143 19222 98313 
47931 84677 46781 
33229 37769 79698 
76682 95118 67862 
56864 21884 53046 
286 5 7 76693 84 1 3 7 
81498 52455 31229 
94375 63931 15454 

38247 84138 71165 
93856 77347 82872 
47919 62519 34731 
75785 95794 15923 
52168 11983 99488 
19382 33622 27299 
81471 49461 58617 
26624 58256 66356 

6 4 5 9 3 2 6 S 7 5 4 3 6 4 4 


9 9 2 4 2 

2 6 8 3 7 
1 7 7 9 8 

5 2 5 2 3 

3 4 114 

7 3 3 7 6 

6 8 4 8 0 

8 1 6 5 0 

4 5 9 0 1 

7 2 8 4 8 

9 8 7 9 4 

5 12 6 3 

8 5 4 2 6 

1 6 9 7 5 

4 4 112 

3 9 6 S 1 

0 3 5 5 7 

2 7 3 3 9 

1 2 7 7 2 

3 6 3 9 6 

8 3 5 2 7 

5 9 2 1 3 

6 4 9 5 9 

7 5 8 3 4 

4 114 1 

9 7 4 6 8 

2 8 6 8 5 

7 6 3 5 6 

5 4 12 7 

4 3 7 1 8 

3 8 5 9 9 

8 2 8 6 5 

6 14 8 4 

9 9 6 3 2 

1 7 2 7 3 

2 5 9 4 1 

4 4 7 2 2 

2 9 14 7 

8 2 8 9 8 
6 7 2 1 3 

6 15 8 6 

7 3 3 5 5 

9 5 6 3 4 
1 8 4 6 1 

3 6 9 7 9 
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4 2 6 5 9 

6 8 13 1 

3 6 4 7 8 

1 4 8 2 4 

7 9 9 6 6 

9 5 3 1 3 

2 15 8 5 

5 3 7 9 7 

8 7 '2 4 2 

16383 72679 
2289825526 

8 7 7 6 1 9 7 9 5 3 

35614 36891 

9 3 2 7 5 4 1 7 3 7 

4494618212 
7 8 1 6 7 5 4 3 6 8 

6 1 4 3 9 8 9 4 4 5 

5 9 5 2 2 6 3 1 8 4 

4 9 6 6 1 1 1 8 3 1 

53196 68259 
86857 75113 
37442 22945 
08339 49672 
24714 33386 
15928 96568 
71583 54794 
62275 87427 

31727 54363 
22588 96555 
15869 22124 
63276 85881 
98414 73738 
57633 41279 
46155 17446 
84991 39692 
79342 68917 

92495 48448 
11813 86599 
39557 24933 
86672 69156 
75339 73687 
24921 97224 
43784 51812 
68166 35771 
57248 12365 


7 4 6 4 3 2 1 2 2 4 

2 1 8 3 9 5 9 6 5 9 

9227874478 
35982 63737 

5 7 5 6 1 8 6 3 9 3 

1 6 4 1 5 4 2 8 8 1 

4 3 1 5 4 1 6 5 4 5 

6 9 7 2 7 3 8 1 1 2 

8 8 3 9 6 9 7 9 6 6 

7 8 1 6 5 1 1 4 4 8 

3 4 8 5 1 4 8 7 2 1 

43779 55296 
15414 87175 
91937 24934 

2 2 6 8 2 7 9 3 6 3 

6 9 3 2 6 3 6 5 8 7 

6 6 2 9 3 9 2 6 1 2 

87548 63859 

37549 97499 
05492 28503 
73927 69736 
91338 12117 
86783 81928 
29114 36371 
14661 55252 
58875 74885 
42256 43644 

98644 86696 
31488 39317 
49991 13468 
75722 45251 
64539 57729 
52277 94144 
13115 68983 
86853 21575 
27366 72832 

19485 27965 
27677 68698 
81923 76577 
96531 11751 
68254 34146 
42748 83432 
73399 99219 
55116 52383 
34862 45824 
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) (Gontimed) 
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6 8 2 7 8 

7 4 6 4 2 

6 

5 8 12 6 

5 4 111 

1 

7 3 7 5 7 

6 7 4 4 9 

3 

8 4 6 7 4 

2 8 3 9 2 

8 

1 2 5 6 5 

7 2 

9 7 6 

4 

3 6 2 9 9 

4 6 

5 2 7 

7 

2 13 3 1 

1 9 

2 6 3 

2 

6 7 4 4 8 

3 3 

8 5 5 

9 

4 5 9 1 2 

8 5 

7 3 8 

5 

9 9 8 8 3 

0 1 

6 8 4 

6 

9 8 7 3 4 

3 8 

2 13 

3 

2 2 2 2 9 

1 4 

8 6 2 

2 

6 7 8 6 7 

2 5 

9 5 7 

1 

8 3 4 5 8 

9 3 

4 2 8 

5 

5 9 5 9 2 

6 2 

5 7 5 

6 

4 6 9 7 1 

7 7 

6 9 6 

7 

3 1113 

8 9 

7 3 4 

4 

1 5 6 8 6 

4 6 3 8 9 

8 

7 4 3 4 5 

5 1 

1 4 1 

9 


TABLE 15»6 Peemutatiohs 

1 4 9 7 8 
9 6 4 4 2 

7 5 18 4 

3 7 5 5 6 
6 17 13 

4 3 3 2 7 

8 2 6 3 5 

2 8 8 9 9 

5 9 2 6 1 


15,5 


9 3 2 3 

4 2 5 4 

15 6 6 

6 1 3 2 

3 7 7 1 

8 4 8 8 

2 6 9 5 

7 8 4 7 

5 9 19 

5 8 17 

9 4 3 8 
2 6 8 3 

4 7 7 4 

7 9 9 1 

6 2 5 6 
13 6 5 

8 12 9 

3 5 4 2 

5 6 8 8 
17 9 6 

3 8 11 
2 3 7 7 

9 9 6 9 

4 2 5 5 

7 4 3 2 

8 5 4 4 

6 12 3 

2 17 3 

7 3 3 4 

9 5 9 2 

4 2 4 7 
6 4 8 1 

3 8 5 6 

8 6 6 8 
17 19 

5 9 2 5 

5 3 2 6 

8 9 8 4 

4 118 

16 7 2 

9 7 3 7 
2 2 6 1 
3 5 4 9 

6 4 9 5 

7 8 5 3 



00 Oi 


15.5 


2 3 17 1 
9 6 3 3 
4 5 5 8 

1 6 8 17 
4 12 4 9 

6 2 7 8 6 

3 5 4 9 2 
9 8 3 6 5 

7 7 9 2 4 

9 9 6 8 1 

1 7 9 5 6 

7 3 2 7 9 

8 5 7 4 4 

4 8 8 6 8 

6 14 9 7 

2 6 13 2 

3 2 5 1 3 

5 4 3 2 5 

9 1 7 8 7 

4 8 9 6 3 

7 4 2 2 2 

5 9 1 3 4 
3 3 3 7 5 
22 5 1 1 

1 6 8 5 6 

8 5 6 4 9 

6 7 4 9 8 

6 3 8 7 9 
5 4 6 4 8 

9 8 5 3 7 
12 19 4 

3 7 2 6 1 

4 9 3 1 6 
8 5 4 5 5 

7 6 7 2 3 

2 19 8 2 
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TABLE 15.6 Permutations of 9 {Contmued) 


4 2 

4 3 1 

9 2 111 

4 6 9 3 5 

3 6 2 5 5 

1 14 1 2 

7 14 5 3 

3 4 

6 8 6 

4 18 6 6 

8 12 7 9 

15 12 8 

6 6 3 4 5 

6 6 14 6 

6 1 

9 1 5 

5 6 9 4 7 

6 3 5 4 7 

5 17 7 2 

3 8 15 4 

3 2 2 8 7 

2 9 

2 5 3 

8 9 4 7 9 

7 4 7 2 2 

9 4 9 9 3 

9 5 9 2 1 

9 8 6 7 4 

5 5 

3 6 9 

7 8 6 3 8 

1 8 4 9 6 

6 2 8 1 7 

4 7 8 3 7 

8 3 3 9 8 

8 3 

7 4 7 

2 5 7 5 3 

3 2 6 8 8 

2 8 5 3 9 

2 2 2 9 6 

1 7 8 2 2 

1 7 

8 7 4 

3 4 2 8 5 

5 9 8 6 4 

8 3 3 8 4 

5 4 6 8 3 

4 9 5 6 9 

7 6 

5 2 2 

1 7 5 9 2 

2 7 15 3 

7 9 4 6 6 

8 3 7 7 S 

6 4 9 3 1 

9 8 

1 9 8 

6 3 3 2 4 

9 5 3 1 1 

4 7 6 4 1 

7 9 6 6 9 

2 5 7 1 5 

9 5 

1 6 2 

8 9 5 8 9 

7 7 4 9 9 

4 9 5 4 6 

6 9 5 5 2 

6 8 9 7 1 

1 8 

7 5 1 

117 7 8 

2 4 7 4 7 

9 5 12 2 

16 1 9 8 

8 6 6 1 8 

4 1 

2 4 3 

3 6 13 3 

3 3 8 6 1 

5 6 3 5 5 

4 4 4 2 9 

6 3 13 7 

5 9 

4 1 4 

7 2 3 4 6 

5 9 2 2 6 

3 8 8 1 1 

7 8 7 1 4 

4 18 9 9 

3 2 

3 9 7 

6 7 2 5 1 

6 6 3 8 5 

2 14 3 3 

3 7 6 7 7 

1 9 7 6 2 

8 3 

9 8 6 

5 8 6 1 2 

8 2 5 1 2 

1 4 2 9 8 

9 2 2 8 3 

2 7 2 4 5 

6 7 

5 3 5 

2 5 4 6 7 

116 5 3 

6 7 7 8 9 

8 3 36 5 

3 5 3 8 3 

7 6 

6 2 8 

9 3 8 2 5 

9 8 13 8 

8 3 6 7 4 

6 19 3 6 

7 4 4 2 6 

2 4 

8 7 9 

4 4 9 9 4 

4 5 9 7 4 

7 2 9 6 7 

2 5 8 4 1 

9 2 5 5 4 

9 8 

6 14 

2 9 17 7 

4 8 4 8 6 

8 2 3 8 2 

8 2 3 2 6 

7 3 8 7 5 

3 7 

9 2 1 

9 5 9 4 4 

9 4 3 4 3 

7 3 5 7 6 

5 9 4 6 8 

5 8 4 6 6 

6 3 

3 6 9 

3 1 5 2 1 

6 9 9 9 7 

5 6 2 6 9 

2 6 6 8 1 

8 6 5 1 8 

19 

7 5 3 

6 2 8 6 5 

1 7 8 7 9 

1 4 4 9 8 

9 17 3 9 

1 9 6 6 2 

8 4 

2 7 6 

1 6 3 5 6 

3 6 1 6 1 

9 1 1 5 5 

6 7 9 4 4 

3 13 8 7 

7 5 8 8 8 

5 7 2 8 3 

8 3 6 3 8 

4 7 6 4 3 

1 3 5 I 3 

9 2 1 4 9 

2 2 

5 3 5 

8 3 4 9 8 

2 5 2 5 4 

2 5 7 3 7 

4 5 15 2 

6 4 2 2 1 

4 6 4 4 7 

7 8 6 3 2 

5 15 15 

6 8 9 1 4 

3 4 2 9 5 

4 7 7 9 3 

5 1 

1 9 2 

4 4 7 1 9 

7 2 7 2 2 

3 9 8 2 1 

7 8 8 7 7 

2 6 9 3 4 

5 19 8 5 

5 18 5 4 

112 2 4 




7 4 2 4 7 

9 3 6 8 9 

6 2 16 8 




0 7 8 5 6 

6 9 4 1 1 

7 5 5 3 3 




8 6 

1 1 8 

7 5 3 7 8 

2 9 9 5 9 




1 8 

6 6 3 

4 8 9 4 5 

8 4 8 4 5 




3 5 

4 2 2 

8 7 7 6 2 

9 7 7 8 2 




9 9 

7 9 9 

3 4 5 2 7 

3 8 3 1 7 




2 3 

3 7 1 

2 6 13 3 

4 3 6 7 1 




4 2 

5 3 4 

1 2 2 9 6 

6 6 4 9 6 





15.5 
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TABLE i5.7 

Permutations OF 16 
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0 

0 

3 

11 

11 

12 

9 

5 

1 

6 

15 

3 

5 

3 

10 

2 

8 

10 

6 

4 

8 

2 

15 

8 

7 

13 

13 

12 

. '5- 

14 

11 

15 

14 

8 

9 

13 

13 

5 

8 

2 

3 

6 

13 

16 

(j 

5 

15 

16 

4 

8 

7 

13 

11 

5 

6 

12 

12 

4 

9 

1 

9 

13 

14 

15 

13 

2 

3 

8 

16 

7 

5 

12 

12 

2 

4 

10 

4 

5 

3 

3 

5 

11 

12 

15 

12 

14 

12 

6 

15 

10 

7 

7 

9 

8 

10 

6 

6 

12 

11 

12 

11 

3 

13 

3 

3 

2 

10 

2 

2 

15 

15 

8 

11 

5 

12 

14 

3 

8 

15 

16 

9 

7 

4 

14 

13 

8 

15 

9 

3 

9 

'5 ■ 

14 

10 

13 

8 

16 

: 7 ' 

6 

1 

10 

16 

2 

6 

16 

9 

1 

1 

5 

13 

12 

8 

2 

7 

10 

6 

13 

16 

13 

13 

11 

12 

9 

7 

7 

4 

12 

5 

12 

10 

14 

12 

1 

1 

2 

5 

4 

11 

11 

8 

13 

3 

6 

10 

10 

5 

16 

10 

.8 

4 

7 

1 

11 

15 

1 

3 

11 

15 

15 

14 

15 

1 

8 

15 

9 

11 

3 

3 

1 

7 

1 

10 

10 

8 

7 

14 

15 

12 

16 

10 

6 

10 

12 

1 

8 

7 

4 

13 

10 

9 

2 

14 

6 

16 

4 

7 

1 

13 

7 

7 

8 

■ ■ 2: 

1 

16 

6 

6 

13 

n 

7 

16 

6 

3 

16 

2 

11 

I 

7 

2 

10 

9 

1 

8 

13 

3 

1 

10 

7 

6 

4 

8 

8 

11 

12 

6 

9 

16 

9 

8 

■■ 2 ■ 

4 

7 

14 

15 

11 

13 

S 

11 

14 

3 

12 

3 

16 

13 

5 

16 

2 

5 

14 

1 

12 

2 

6 

14 

2 

12 

1 

15 

2 

14 

1 

11 

4 

5 

3 

3 

9 

8 

5 

9 

16 

9 

4 

10 

14 

4 

14 

6 

8 

16 

6 

16 

'2 

4 

12 

6 

11 

2 

10 

14 

5 

5 

15 

4 

8 

11 

2 

9 

7 

11 

5 

5 

0 

9 

14 

12 

15 

12 

9 

4 

2 

14 

7 

5 

9 


15 

10 

4 

15 

11 

13 

9 

15 

13 

15 

13 

3 

1 

3 

6 

4 

13 

16 

5 

5 

16 

5 

9 

13 

14 

4 

13 

3 

4 

14 

4 


TABLES OF RANDOM PERMUTATIONS 
TABLE 15.7 Permutations op 16 {Continued) 


5 

3 

6 

7 

2 

8 

9 

13 

4 

7 

10 

14 

8 

9 

9 

9 

11 

16 

16 

3 

6 

10 

13 

4 

4' 

14 

1 

12 

10 

11 

3 

4 

8 

8 

6 

12 

7 

5 

9 

13 

3 

12 

16 

10 

5 

7 

1 

1 

9 

5 

16 

1 

7 

10 

7 

11 

1 

15 

10 

14 

2 

13 

2 

11 

2 

15 

15 

2 

7 

13 

3 

4 

16 

16 

9 

9 

10 

2 

S 

6 

12 

13 

15 

11 

16 

8 

3 

6 

3 

11 

15 

3 

9 

12 

8 

2 

12 

2 

5 

10 

6 

10 

8 

11 

14 

6 

13 

10 

7 

5 

14 

13 

2 

13 

1 

8 

13 

3 

15 

15 

2 

9 

9 

n 

12 

4 

5 

6 

3 

12 

14 

8 

10 

14 

5 

5 

8 

5 

12 

8 

10 

13 

9 

16 

2 

6 

16 

2 

11 

15 

5 

3 

13 

,7 

11 

11 

1 

16 

3 

5 

8 

13 

■' 4' ■ 

1 

2 

12 

16 

2 

3 

14 

11 

6 

14 

1 

15 

7 

1 

6 

4 

10 

6 

15 

16 

11 

15 

14 

13 

10 

6 

15 

7 

4 

4 

14 

9 

5 

16 

6 

3 

12 

14 

10 

12 

: 2 

10 

7 

4 

3 

2 

8 

9 

16 

9 

6 

11 

12 

3 

7 

16 

8 

9 

9 

4 

13 

S 

8 

10 

7 

11 

1 

15 

1 

11 

14 

1 

14 

14 

'.2 ■ 

7 

8 

14 

10 

1 

‘ ' .5 

1 

13 

12 

16 

11 

9 

10 

14 

3 

7 

12 

2 

5 

13 

15 

7 

13 

12 

15 

5 

4 

4 

12 

15 

12 

16 

8 

7 

13 

16 

6 

3 

1 

11 

15 

5 

11 

4 

6 

1 

14 

4 

16 

7 

4 

3 

0 

4 

15 

5 

4 


12 

1 

W 

6 

1 

2 

15 

6 

2 

11 

6 

14 

1 

9 

10 

9 

5 

5 

8 

8 

6 

7 

15 

• 14 

15 

■7 

8 

11 

15 

14 

7 

7 

15 

13 

3 

4 

2 

11 

13 

7 

9 

1 

3 

12 

5 

11 

9 

14 

12 

11 

10 

12 

7 

12 

16 

14 

7 

15 

1 

8 

14 

4 

6 

8 

14 

S 

13 

3 

15 

2 

8 

12 

10 

9 

12 

14 

9 

4 

5 

12 

5 

13 

16 

4 

3 

1 

10 

3 

8 

10 

1 

2 

14 

4 

10 

10 

13 

1 

10 

6 

2 

12 

16 

14 

3 

2 

9 

11 

7 

4 

14 

13 

15 

IS 

7 

16 

15 

2 

12 

3 

11 

12 

12 

4' 

11 

■2 

15 

6 

14 

7 

8 

6 

16 

1 

15 

13 

16 

8 

7 

10 

13 

1 

7 

1 

10 

5 

16 

16 

16 

10 

16 

2 

3 

8 

2 

11 

13 

1 

13 

15 

10 

12 

16 

4 

9 

14 

6 

1 

12 

8 

5 

11 

5 

16 

9 

2 

4 

9 

7 

16 

2 

7 

1 

6 

13 

2 

5 

11 

11 

3 

16 

12 

12 

8 

16 

9 

14 

0 

13 

10 

9 

6 

4 

5 

9 

6 

15 

13 

4 

12 

3 

4 

5 

1 

16 

4 

.5 

7 

16 

15 

16 

12 

4 

8 

8 

2 

7 

6 

4 

2 

10 

3 

10 

10 

1 

0 

4 . 

5 

4, 

5 

3 

n 

3 

13 

1 

1 

1 

14 

14 

11 

4 ■ 

9 

11 

2 

10 

15 

8 

S 

13 

16 

3 

15 

n 

7 

9 

15 

2 

10 

6 

7 

7 

14 

0 

2. 

5 

9 

13 

8 

14 

1 

5 

15 

3 

3 

9 

6 

9 

3 

13 

15 

5 

3 

1 

6 

6 

5 

12 

13 

2 

11 

4 

6 

10 

12 

5 

11 

14 

6 

u 

8 

15 

3 

5 

16 

1 

8 

13 

8 

15 

4 

10 

8 

15 

4 

15 

16 

8 

3 

8 

14 

16 

14 

15 

8 

7 

4 

6 

8 

14 

13 

9 

6 

14 

14 

15 

16 

16 

6 

13 

14 

4 

13 

14 

4 

10 

13 

7 

2 

8 

4 

10 

11 

11 

U 

6 

; 4 

3 

16 

10 

11 

10 

0 

15 

16 

3 

5 

5 

12 

16 

16 

12 

13 

6 

5 

6 

16 

15 

13 

3 

6 

10 

11 

7 

11 

11 

12 

16 

13 

. , 7' 

1 

7 

9 

13 

10 

14 

7 

3 

8 

0 

10 

3 

6 

1 

13 

7 

5 

3 

2 

9 

1 

10 

4 

14 

4 

11 

8 

1 

y 

7 

12 

8 

4 

12 

13 

6 

15 

14 

2 

5 

2 

3 

15 

15 

12 

1 

13 

8 

4 

'2 

3 

2' 

6 

10 

4 

7 

12 

3 

3 

1 

10 

1 

13 

3 

0 

16 

12 

11 

13 

2 

15 

2 

1 

7 

1 

1 

7 

8 

13 

2 

11 

1 

6 

7 

7 

14 

6 

6 

1 

12 

3 

11 

14 

6 

11 

5 

16 

14 

7 

■ 2' 

5 

10 


6 

4 

'2 ■ 

1 

15 

■. 5 

1 

16 

2 

5 

9 

8 

10 

3 

14 

12 

7 

4 

5 

2 

1 

3 

14 

11 

10 

9 

15 

11 

11 

4 

15 

2 

12 

12 

9 

10 

1 

15 

10 

2 

5 

16 

14 

7 

10 

9 

1 

12 

8 

6 

12 

8 

1 

5 

2 

5 

11 

12 

11 

12 

5 

5 

16 

4 

6 

3 

14 

5 

13 

6 

7 

n 

13 

12 

9 

4 

9 

3 

13 

9 

9 

14 

14 

3 

15 

2 

3 

10 

9 

4 

9 

8 

9 

8 

12 

15 

5 

15 

10 

12 

9 

16 

8 

9 

10 

13 

16 

4 

6 

15 

7 

11 


0 

1 

5. 

8 

lo 

4 

9 

7 

. 6 

8 

8 

13 

11 

8 

0 

2 

9 

11 

15 

12 

15 ■ 

6 

13 

6 

15 

3 

1 

15 

3 

6 

10 

4 

15 

9 

9 

13 

2 

6 

15 

8 

■ '7 

7 

12 

16 

■ 2 ■ 

6 

5 

7 

9 

2 

9 

12 

2 

13 

' ■ ' 4' 

16 

7 

13 

3 

0 

1 

5 

1 

5 

11 

3 

10 

1 

11 

10 

■■ 1 

1 

14 

6 

15 

10 

14 

7 

13 


8 

13 

11 

2 

1 

14 

7 

16 

8 

3 

13 

11 

10 

7 

8 

11 


2' ■ 

14 ■ 

9 

2- 

10 

■ 6^ 

11 

11 

12 

. 5 

14 

16 

16 

6 

12 

1 

0 

10 

9 

10 

12 

11 

9 

2 ■ 

1 

. ^2 

9 

11 

14 

13 

4 

8 

16 

9 

16 

1 

4 

5 

14 

6 

3 

2 

4 

7 

14 

5 

10 

3 

2 

1 

. 4 
12 

12 

3 

7 

1(5 

11 

14 

12 

3 

6 

6 

3 

■ 7' 

5 

14 

'v'l 

12 

2 

3 

8 ■ 

: '15 

13 

13 

2 

14 

8 

11 

4 

4 

14 

4 

12 

2 

2 

15 

9 

13 

6 

8 

1 

3 

3 

11 

3 

10 

10 

11 

3 

12 

9 

10 

13 

11 

3 

7 

13 

7 

7 

7 

10 

14 

6 

13 

1 

16 

■ '7- 

10 

16 

16 

8 

15 

10 

4 

5 

5 

15 

4 

16 

14 

■ ■ ■■ 1 

9 

6 

4 

6 

16 

12 

15 

14 

4 

6 

3 

8 

6 

9 

10 

13 

11 

8 

4 

1 

■2 ■' 

15 

15 

13 

’ ’■7 ■ ” 

1 

15 

14 

12 

. 2 ■ 

7 

16 

10 

12 

15 

8 

5 

16 

5 

12 

16 

10 

10 

5 

8 

13 

13 

4 

13 

15 

i 

11 

4 

16 

O 

6 

6 

8 

4 

3 

3 

14 

8 

7 

10 

12 

9 

16 

12 

15 

14 

11 

5 

14 

9 

2 

4 
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TABLE 16.7 Permutations of 16 {ConiinueS) 


15 

2 

11 

8 

9 

2 

11 

15 

3 

1 

15 

16 

1 

2 

1 

3 

12 

1 

4 

3 

3 

6 

3 

2 

7 

3 

4 

9 

8 

4 

4 

13 

3 

4 

5 

6 

10 

16 

1 

8 

4 

6 

9 

4 

12 

11 

9 

6 

12 

13 

7 

2 

9 

15 

15 

14 

11 

8 

6 

5 

5 

13 

5 

11 

11 

10 

3 

14 

7 

16 

1 

15 

4 

12 

9 

9 

8 

12 

15 

2 

10 

8 

1 

10 

16 

14 

10 

10 

16 

2 

9 

11 

7 

1 

13 

4 

16 

9 

13 

12 

12 

16 

4 

7 

14 

13 

16 

11 

14 

7 

16 

3 

14 

7 

6 

10 

1 

13 

7 

16 

9 

9 

8 

12 

I 

15 

2 

4 

1 

12 

3 

12 

13 

16 

16 

1 

14 

15 

2 

11 

14 

10 

7 

1 

10 

9 

1 

7 

10 

14 

5 

4 

10 

13 

7 

7 

9 

7 

3 

10 

8 

3 

6 

13 

15 

8 

7 

2 

13 

6 

13 

9 

5 

14 

4 

5 

15 

10 

12 

15 

7 

14 

16 

15 

13 

12 

6 

3 

6 

8 

8 

5 

15 

3 

8 

2 

7 

5 

8 

7 

2 

11 

2 

9 

6 

5 

13 

13 

,4 : 

3 

12 

6 

2 

8 

11 

13 

2 

2' 

10 

14 

11 

7 

14 

3 

8 

16 

12 

1 

11 

11 

6 

1 

8 

6 

2 

16 

5 

11 

16 

6 

1 

4 

13 

5 

3 

6 

5 

5 

9 

9 

10 

7 

16 

5 

10 

15 

6 

14 

9 

13 

6 

12 

15 

14 

4 

4 

14 

8 

5 

5 

11 

8 

6 

11 

14 

12 

13 

3 

. 5, 

9 

13 

1 

10 

16 

2 

7 

8 

16 

15 

15 

14 

14 

11 

10 

12 

11 

4 

4 

14 

1 

16 

15 

12 

6 

5 

1 

15 

12 

2 

10 

2 

10 

12 

9 

3 

8 

3 

6 

11 

4 


7 

1 

3 

9 

3 

12 

2 

1 

5 

6 

12 

12 

10 

2 

14 

15 

3 

10 

9 

2 

9 

14 

16 

14 

16 

10 

13 

4 

6 

1 

7 

15 

S 

15 

10 

4 

16 

12 

13 

14 

12 

7 

4 

5 

12 

2 

1 

12 

9 

11 

9 

9 

11 

6 

2 

9 

8 

1 

15 

4 

13 

3 

15 

16 

10 

7 

12 

9 

2 

5 

4 

2 

16 

4 

8 

8 

15 

13 

10 

8 

8 

4 

11 

12 

9 

15 

6 

5 

3 

13 

3 

4 

3 

14 

12 

2 

5 

6 

14 

10 

10 

13 

6 

8 

2 

5 

8 

3 

13 

7 

16 

10 

15 

8 

3 

10 

14 

2 

7 

3 

16 

8 

9 

13 

13 

4 

5 

2 

1 

9 

8 

5 

2 

7 

7 

16 

11 

14 

16 

13 

6 

U 

1 

11 

14 

16 

7 

13 

11 

3 

2 

6 

4 

16 

9 

5 

2 

11 

2 

12 

5 

16 

2 

2 

7 

8 

11 

15 

4 

10 

15 

14 

13 

11 

6 

3 

12 

3 

4' 

7 

15 

15 

14 

1 

3 

1 

14 

10 

16 

15 

6 

13 

9 

9 

1 

11 

6 

15 

11 

5 

3 

5 

■ 7 

3 

15 

6 

16 

11 

10 

8 

11 

8 

1 

13 

5 

1 

1 

5 

12 

9 

2 

9 

5 

7 

4 

13 

10 

16 

8 

16 

10 

3 

12 

12 

15 

; 7 

13 

8 

3 

16 

4 

10 

12 

6 

1 

3 

3 

14 

12 

12 

1 

7 

14 

10 

4 

13 

4 

16 

1 

16 

14 

12 

8 

13 

6 

11 

9 

6 

7 

14 

14 

1 

0 

5 

13 

14 

10 

4 

5 

1 

1 

2 

10 

10 

8 

14 

15 

7 

15 

: 2 ■: 

5 

16 

5 

1 

16 

6 

9 

7 

6 

6 

11 

G 

13 

4 

11 

9 

4 

8 

14 

4 

13 

11 

7 

3 

11 

12 

7 

9 

8 

11 


0 

13 

10 

7 

11 

11 

11 

.. 7 

11 

11 

16 

11 

15 

0 

2 

12 

4 

10 

10 

1 

6 

8 

11 

16 

8 

14 

G 

16 

13 

2 

14 

12 

14 

1 

7' . 

16 

7 

12 

8 

6 

15 

3 

12 

11 

14 

7 

15 

4 

12 

14 

8 

10 

13 

6 

5 

5 

9 

5 

1 

7 

7 

7 

" .5 

0 

15 

1 

2 

10 

4 

8 

5 

0 

2 

8 

3 

7 

13 

15 

4 

14 

2 

15 

1 

8 

16 

9 

8 

12 

16 

12 

13 

8 

8 

12 

10 

4 

2 

6 

14 

11 

13 

12 

2 

6 

7 

3 

13 

9 

9 

13 

2 

5 

1 

3 

9 

2 

1 

16 

2 

5 

16 

2 

13 

14 

6 

2 

9 

13 

5 

7 ■■ 

6 

4 

10 

5 

14 

8 

8 

8 

6 

13 

8 

11 

9 

13 

4 

13 

14 

8 

6 

16 

3 

7 

11 

14 

6 

1 

16 

1 

3 

4 

4 

10 

15 

■ .'4- 

13 

12 

3 

2 

3 

10 

7 

3 

16 

16 

16 

6 

6 

11 

9 

12 

3 

9 

7 

3 

9 

5 

.12 

5 

15 

4 

4. 

14 

3 

7 

1 

11 

10 

3 

5 

2 

14 

14 

4 

15 

1 

16 

10 

11 

S 

9 

11 

13 

12 

10 

4 

3 

11 

2 

7 

15 

12 

1 

3 

10 

12 

10 

7 

1 

2 

15 

15 

16 

4 

11 

12 

15 

5 

14 

12 

10 

5 

4 

6 

1 

5 

6 

5 

3 

7 

5 

1 

9 

7 

15 

11 

9 

14 

4 

15 

16 

10 

6 

16 

2 

2 

15 

4 

14 

14 

3 

9 

1 

5 

13 

16 

10 

12 

13 

11 

8 

11 

5 

8 

12 

3 

8 

1 

15 

10 

6 

12 

16 

6 

2 

13 

13 

15 

9 

16 

3 

1 

16 

14 

5 

10 

4 

16 

G 

1 

1 

10 

2 

9 

4 

S 

14 

3 

7 

13 

9 

3 

4 

12 

11 

10 

13 

11 

10 

1 

4 

12 

11 

5 

11 

16 

7 

14. 
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Missing data, 72 
balanced incomplete blocks, 325 
balanced lattice, 277 
confounded factorials, 202 
incomplete latiii squares, 374 
latin square, 110 
lattice squares, 357 
mathematical model, 72 
partially balanced lattices, 290 
randomized blocks, 98 
split-plot design, 227 

Number of figures, 54 

Objectives of experiment, 10 

Orthogonal, 57 
components, 397 

Permutations, random, 414 
construction, 414 
tables 

permutations of 9, 422 
permutations of 16, 428 
use, 414 

Plaid square designs 
statistical analysis and plans 
2 X 2 X (2®) factorial in an 8 X 8 
square, 251, 258 

2 X 2 X (2^) factorial in an 8 X 8 
square, 251, 258 . , 
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PL'iiining of experimentf?^ \) 
experiiiieiitai inateri:il, 31 
objectives, 10 
3:eplictitions, number of, 17 
selection of treatments, 11, 29 
Plans 

balanced incomplete blocks, 329 
index to plans, 271, 327 
completely randomized designs, 88 
confoumled factorials 
2" series, 157, 104, 208 
index to plans, 180 
3^^ series, 104, 190, 212 
index to plans, 186 
inix(Ml series, 174, 199, 214 
index to plans, 186, 188 
cubic lattice designs, 300 
factorial experiment.s, 146 
fractional replication, 189 
graecodatiri squares, 117, 120 
half-})laid squares, 256 
incomplete latin squares, 272, 379 
index, r < ^, 370 
index, r > t, 376 
^latin square designs, 103, 109 
lattice designs, 271 
balanced lattices, 304 
partially balanced dosigiivS, 304, 309 
lattice squarci designs, 300 
index of plans, 272 
plaki square desigiKs, 25S 
quasi-laiiu squarcvS, 252 
rectangular lattice designs, 311 
split-plot design 
BCD confounded, 238 
for 7 X 2 ^ factorial, 239 
simple, 218 

sub-unit treatments confounded, 237 
sub-unit treatments in iatin square, 
235 

sub-unit treatineiits in strips, 232 
systcmalic arrangement, 231 
Pooled error mean square 
series of experiments, 406 
Precision, 16 

Quasi-latin squares, 241 
efficiency,' 247 
randojnijsation, 241 


Quasi-latin squares, statistical antilysis 
and plans 

2'^ factorial in a 4 X 4 squim^, 242, 
252 ■ 

2‘^ factorial in an 8X8 square, 2-14, 

252 

2^ factorial in an 8X8 square, 2*14, 

253 

2^ factorial i!i an 8X8 squares, 245, 
253 

3'^ fjuttonal in a 9 X 9 s<piar(3, 245, 
254' 

S‘^ factorial in a 9 X 9 s{j[uare, 245, 
255 

4X2^ factorial in an 8 X 8 square, 
246,256 

treatments applied t.o compk^te rows, 
248, 250 

treatments applied to complete rows 
and columns, 251 

Eandomimtion 
function of randomization, 6 
more than 16 numbers, 416 
restricted, 8 

tables of random numbers 
9 numbers, 422 
16 numbers, 428 
tests of randomness, 416 
R-andoinizetl blocks, 94 
advant-ages, 94 
efficiency, 100 
missing data, 98 
randomization, 95 
standard error, 98 
statistical analysis, 95 
Rectangular lattices, 292 
see also Lattice designs 
Eegreission 

components of treatments, 132 
quiifbatic component, 409 
Replications, 1 

for prescribed limits of error, 23 
table, 24 

for tests of significance, 17 
tables, 20, 21 

fractional replication experiments, 189 
ill latin squares, 105 ' . 

in lattice- designs, 282 
single replication experinients, 188 
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Series of experimeiits 

all experiments, same design, 394 
analysis, initial steps in, 391 
experiments, unequal size, 401 
heterogeneity of experimental error 
variances, 399 

heterogeneity of interaction variance, 
396 

lattice experiments, series of, 405 
mathematical model, 395 
mimerieal example, 392 
orthogonal components, 397 
repetitions in space and time, 411 
summary, combined amdysis, 401 
treatments X places interactions test, 
407 

Simple effects, 122 
Split-plot designs, 218 
confounded designs with plots split, 
236 

repeated subdivision, 229 
standard errors, 230 
simple split-plot design, 218 
efficiency, 221 
example, numerical, 224 
experimental error, 219 
mathematical model, 219 
missing data, 227 
randomization, 222 
standard error, 223 
statistical analysis, 222 
sub-unit treatments 
confounded, 237 
in latiii squares, 234 
in strips, 231 

systematic arrangement, 230 
Standard error formulae 
balanced incomplete blocks, 318 
completely randomized blocks, 87 
cross-over, 115 
factorials, 138 
graeco-latin squares, 118 
half-plaid squares, 250 
incomplete block, 262, 264 
incomplete latin squares, 371 
latin squares, 106 
lattice squares, 348 
lattices 
balanced, 274 


Standard error formulae, lattices, cubic, 
302 

partially balanced, 282 
rectangular, 294 
quasi-latiii squares, 242 
randomized blocks, 95 
split plot, 222 
Standard errors, 17 
pooled error, 91 
rules for calcultiting, 64 
when variances heterogeneous, 91 
Statistical analysis, 39 
example, numerical, 49 
general method of analysis, 39 
analysis of variance, 47 
assumptions, 41 

effects of errors in assumptions, 83 
confidence limits, 44 
construction, 46 

estimation of treatment effects, 42 
mathematical model, 39 
tests of significance, 44 
Subdivision 
of error S.S., 69 
of treatments S.S., 55 
Sum of products, 76 

f-distribution 
approximate ^', 92 
errors different, 91 
Technique, 30 
Test of sigiiifieanee, 4, 44 
effects suggested by data, 67 
number of replications for, 20 
Treatments, 11 

comparison among means, 64, 71 
components, 56 
effects, estimating, 42 
linear functions, 56 
means, adjusted, 32 
selection of, 11, 29 
sum of squares 
incomplete subdivision, 59 
sample sets, 58 
single components, 56 
subdivision, 55 

Youdcn squares, see Incomplete latin 
squares 




